- cross-posted to:
- technology@lemmy.world
- cross-posted to:
- technology@lemmy.world
We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB), grouped by popularity.
This release includes the largest publicly available music metadata database with 256 million tracks and 186 million unique ISRCs.
It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.
After Meta scraped all their books they have the perfect defense now. All they have to say is “we’re training a music AI” and they’re apparently untouchable.
Well, they have to say “we’re training a music AI” while slipping several million dollars into the pockets of the right people. Rich people don’t win legal battles by actually proving what they did isn’t illegal, they do it by discreetly paying people to say they did.
Often and increasingly they are not bothering with the discretion part anymore.
And slip the POTUS a gold phone or a couple thousand bucks and you are g2g
“Honey, all I need is $10,000 for a server and we’ll never pay for Spotify again”
Omg if my girlfriend had $10k to give me for a server I could buy like a RAM or two!
I cannot fathom the legal fees that will be incurred if they release 99.6% of Spotify to the public for free. Holy fucking shit.
Maybe they could say they’re AI. Apparently copyright infringement’s legal then.
Have to find them first
Yeah they’re definitely gonna catch some legal action for this, even tho I’m all for them releasing the data
Almost makes me want to get into torrenting again. But dab.yeet.su, squid.wtf, and doubledouble.top usually have me covered with ddl
Does this make Anna’s Archive the top music tracker now? Lol move over RED
holy fuck

Dns blocked in germany. fun.
Does DoH work? (Secure DNS in firefox’s settings)
Simply choose a private DNS server like mullvad,quad,etc. and it should work…
https://www.privacyguides.org/en/dns/
From the mega thread here
It’s probably blocked by your particular ISP, not every German ISP.
Would be interesting if someone checked what % of that archive is slopified.
Honestly, this is the best time to snapshot it, because even with the slop already there, the exponential increase that’s about to happen will absolutely dwarf what’s there now.
This is such important work o7
I got 10tb I’ll throw at a few slices of this.
Is that ALL off the songs?
more like “backify”. seriously!
Ok, how do we download this?
Step 1: Buy £6,000 worth of identical hard drives and a motherboard with 16 SATA ports. Or £12,000 worth and a RAID 1 server rig. Or £24,000 and RAID 6
6k for 10TB?
A Raspberry pi with a sata hat and 16TB hard disk could download and serve this for well under 500 quid.
An off the shelf 4 bay NAS with 4x 6TB drives in raid 5 would give you 15TB or so formatted capacity with redundancy. That would easily be under 2k… Jeez grab 2 and sync contents to a second location… even a third location… Still under 6k.
You can get PCI cards to add more sata ports, they don’t all need to be on the motherboard
damn, annas is fucking based. hope they stay safe…
I fucking love annas archive.
Anna does seem rather insane thoughEdit: I was thinking of Alexandra Elbakyan, the creator of Sci-Hub
C’mon you gotta say why im curious
iirc her writings gave me the impression that she basically worships communism and famous soviet communist leaders in a vaguely religious way. I can go look again later and elaborate. It could just be the language barrier that gave me that impression.
Edit: I was misremembering Alexandra Elbakyan, the creator of Sci-Hub, as Anna of Anna’s Archive. Oops
Idk if I’d expect much less from a legendary pirate
Down with the bourgeoisie
Eat the rich
Sodomize the land-owners
Impale all people who have more than 25 reál in their pocket
Literally murder all human beings regardless of their political beliefs
deleted by creator
Huh, weird. Where can I find the writings?
I went to look for a link for you and found that I was misremembering. I was thinking of Alexandra Elbakyan, the creator of Sci-Hub, one of the libraries that Anna’s Archive indexes. In case you’re curious about her, here’s an archive of her personal page on Sci-Hub
Oh wow, thanks for going and checking for me. Good luck out there friend.
Anyone knows if spotify metadata have BPM and keys?
Yes, and it hasn’t been easy to dig up until recently. There were a few ways to search the “hidden” metadata fields that Spotify uses internally. But it definitely hasn’t been easy or straightforward.
Those hidden fields are how Spotify recommends similar artists. You have a few bands on repeat with specific instruments, chord progressions, and singer vocal range? Gee, maybe you’ll enjoy other bands that are similar to that…
Mashup artist detectedWould love lmao. Just bought a second hand VDJ and I’m starting to experiment with mixxx, and I don’t know is the style I like (latincore and adjacents) or if the BPM detected of mixxx isn’t that good.
Good on you for starting that up! I wish you much success in your mixing and/or producing journey!
Both. Per the SQL schema printed in the article, table
track_audio_featureshas both fields tempo and key along with many other technicals. Worth checking out, it’s near the bottom of the page.BPM yes, keys I’m not so sure.













