Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ@lemmy.dbzer0.com · 3 years ago

Help back up the Great 78 Collections before the Record Companies force The Internet Archive to take them down!

yiffit.net

359

Help back up the Great 78 Collections before the Record Companies force The Internet Archive to take them down!

yiffit.net

Arghblarg@lemmy.ca to

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ@lemmy.dbzer0.com · 3 years ago

Music labels sue Internet Archive over digitized record collection - Yiffit.net

yiffit.net

Archived version: https://archive.ph/ZGo6X [https://archive.ph/ZGo6X] Universal Music Group (UMG.AS [http://UMG.AS]), Sony Music Entertainment (6758.T) and other record labels on Friday sued the nonprofit Internet Archive for copyright infringement over its streaming collection of digitized music from vintage records. The labels’ lawsuit filed in a federal court in Manhattan said the Archive’s “Great 78 Project” functions as an “illegal record store” for songs by musicians including Frank Sinatra, Ella Fitzgerald, Miles Davis and Billie Holiday. They named 2,749 sound-recording copyrights that the Archive allegedly infringed. The labels said their damages in the case could be as high as $412 million. Representatives for the Internet Archive did not immediately respond to a request for comment on the complaint. The San Francisco-based Internet Archive digitally archives websites, books, audio recordings and other materials. It compares itself to a library and says its mission is to “provide universal access to all knowledge.” The Internet Archive is already facing another federal lawsuit in Manhattan from leading book publishers who said its digital-book lending program launched in the pandemic violates their copyrights. A judge ruled for the publishers in March, in a decision that the Archive plans to appeal. The Great 78 Project encourages donations of 78-rpm records – the dominant record format from the early 1900s until the 1950s – for the group to digitize to “ensure the survival of these cultural materials for future generations to study and enjoy.” Its website says the collection includes more than 400,000 recordings. The labels’ lawsuit said the project includes thousands of their copyright-protected recordings, including Bing Crosby’s “White Christmas,” Chuck Berry’s “Roll Over Beethoven” and Duke Ellington’s “It Don’t Mean a Thing (If It Ain’t Got That Swing)”. The lawsuit said the recordings are all available on authorized streaming services and “face no danger of being lost, forgotten, or destroyed.”

See linked posting. I’ve commented there with a link to a CLI tool in Python that allows downloading of IA collections. I’ve submitted a patch to enable specifying start and end points so that it’s easier to resume downloading a huge collection, or to allow multiple people to split up the work.

https://archive.org/details/georgeblood

https://archive.org/details/78rpm_bowling_green

F*ck the RIAA and absurdly long copyright.

EDIT: There is more than one collection of 78s on IA, so I updated the title.

The issue with these collections are that they’re absolutely HUGE. And yes, IA offers torrents for them, but as a separate torrent for every. single. album. And the torrents have all data in them – FLAC, fixed-rate MP3, VBR MP3, PDF liner notes, etc. etc… there may be some extremely hardcore data-hoarders out there who want everything, but IMHO as these are scratchy old 78 records, FLAC is overkill to just save the audio in a listenable format. The George Blood collection, just the VBR MP3s, is looking to be about 6TB. With ALL data it might be over 40TB! I can’t afford that many hard drives :)

So, my approach at the moment is to save just the VBR MP3s (they seem to be done at up to 320kbps VBR) and the JPEG album cover. If I have a chance and any storage left afterwards, I can make a separate pass to get the album liner PDFs…

Tool used: https://github.com/jjjake/internetarchive

Patch to allow setting start and end item indices for downloads: https://github.com/jjjake/internetarchive/pull/605

Example usage to grab just the VBR MP3 and record label JPG for each (note the --start-idx and --end-idx arguments):

#ia download --start-idx=4001 --end-idx=8000 -a -i --format="VBR MP3" --format="JPEG" --search collection:georgeblood

I’m going to concentrate on the George Blood collection for now… I’m starting at item 1. It would be great if others started at index 50,000, 100,000, 150,000, … and others started at the end and worked backwards in similarly-sized chunks, so that it’s assured someone gets each of them.

You must log in or # to comment.

Chat

Haui@discuss.tchncs.de
link
fedilink
English
arrow-up
34
arrow-down
1·
3 years ago
Probably stating the obvious but „are in no threat of being deleted“ is an absolute joke.

A company holding the IP can just make it unavailable tormorrow. A big chunk of us is here because reddit somehow is allowed to delete our posts because the law is idiotic. At least european people are allowed to get their data but the cooperative works of thousands of people is threatened due to those laws.

The concept of IP needs to be reformed.
- 𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍@midwest.social
  link
  fedilink
  English
  arrow-up
  15·
  3 years ago
  As concrete examples, try to get a copy of Disney’s 1946 movie, “Song of the South.” It’s been removed from circulation because of its whitewashed presentation of “happy slaves.” Similarly, 6 of Dr. Seuss’ books, including “And to Think That I Saw It on Mulberry Street” were withdrawn because of racial imagery (the mentioned book had a “Chinaman” drawn with a WWII stereotype style - rice hat, sloping eyes, buck teeth).
  
  There’s media you simply can’t get anymore.
  - Haui@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    8·
    3 years ago
    Our culture has been copyrighted.
    - 𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍@midwest.social
      link
      fedilink
      English
      arrow-up
      2·
      3 years ago
      In this case, the media was withdrawn for (arguably) good reasons: the representations were deemed hurtful or harmful.
      
      Good reasons or bad, they still stand as stark examples of how media can disappear at the whims of a single organization.
      - Haui@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        3·
        3 years ago
        Yes and it’s horrific
  - wizardbeard@lemmy.dbzer0.com
    link
    fedilink
    English
    arrow-up
    2·
    3 years ago
    Fun fact, there is a fan made blu-ray quality remaster of Song of the South available on IA.
    - 𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍@midwest.social
      link
      fedilink
      English
      arrow-up
      2·
      3 years ago
      Say what? Now I’m curious how they handled the slavery topic, and found actors for it.
      
      Thanks for the heads-up!
      - GnuLinuxDude@lemmy.ml
        link
        fedilink
        English
        arrow-up
        2·
        3 years ago
        Song of the South does whitewash being black in the USA, but it is set in post-civil war America, so superficially it does not need to handle the slavery topic, which can be dismissed as having been dealt with already.
- Arghblarg@lemmy.caOP
  link
  fedilink
  English
  arrow-up
  15
  arrow-down
  1·
  3 years ago
  Yeah. And whenever anyone says “Oh the music companies would never let these old recordings die, it’s their bread and butter!” I give them this story.
  
  We cannot trust our cultural heritage to any one entity.
  - Haui@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    3·
    3 years ago
    Oof. I just read this. It’s pretty brutal.
- WarmSoda@lemm.ee
  link
  fedilink
  English
  arrow-up
  4
  arrow-down
  11·
  3 years ago
  Did you think your posts on Reddit were protected by copyright laws or something?
  
  Are you seriously comparing posts on a forum to music rights?
  - Haui@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1·
    3 years ago
    What exactly are you trying to convey? That these „works“ made by ordinary people who have only a basic understanding of copyright law should be deleted if someone feels like it? That the law is more important than justice?
    
    Also, do you really think you‘re cool by implying things phrased as a question? Won‘t you just talk like a normal person and state your opinion instead of fake-calling-out others?
    - WarmSoda@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      3·
      edit-2
      3 years ago
      Posts you make on a forum are not “works” that are copyrightable. Deleting a post is not an injustice.
      
      Sentences phrased with a question mark means it’s asking a question. When someone asks a question, the normal response is to then provide an answer to that question.
      
      But you’re just being an asshole. You know exactly what I’m saying, and you know you’re saying ridiculous things so your only response is not answering either of the two questions and and then try to twist it.
      - Arghblarg@lemmy.caOP
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1·
        3 years ago
        
        Posts you make on a forum are not “works” that are copyrightable.
        
        That may depend on the platform – slashdot (remember that site?) once upon a time had a footer on their pages stating “All posts belong to their authors”. There were a few big debates about that being legally enforceable. Hmm. I wonder if there ever was a legal ruling on that.
        
        I notice today their site does not have such a disclaimer. Probably disappeared long ago, due to one of their many corporate buyouts.
        
        WarmSoda@lemm.ee
        link
        fedilink
        English
        arrow-up
        1·
        3 years ago
        You make a good point. They specifically said reddit though.
      - Haui@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        1·
        3 years ago
        I‘m glad you saw your mistake. Have a good one.
        
        WarmSoda@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1·
        3 years ago
        Not going to answer anything, huh? Typical.
        
        Haui@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1·
        3 years ago
        Does this happen to you often? Maybe rethink your approach in discussions.
        
        WarmSoda@lemm.ee
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        1·
        3 years ago
        Or, you could not be an idiot. Try that sometime.
maudefi@lemm.ee
link
fedilink
English
arrow-up
26
arrow-down
4·
3 years ago
Cool tool! Please consider leaving GitHub for any of the numerous FOSS options.
- Arghblarg@lemmy.caOP
  link
  fedilink
  English
  arrow-up
  9
  arrow-down
  1·
  3 years ago
  Oh, it’s not my project – I already have moved my own projects off there, yeah.
  - maudefi@lemm.ee
    link
    fedilink
    English
    arrow-up
    7·
    3 years ago
    That’s awesome! Really encouraging seeing projects and devs migrate away from closed-source and proprietary systems and features. 💪
    - Arghblarg@lemmy.caOP
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1·
      edit-2
      3 years ago
      sourcehut, self-hosted Gogs or Forgejo are some good candidates. Gitea is popular, but there’s apparently been some drama about them going commercial without proper buy-in from their contributors. (The code lineage is AFAIK Gogs → Gitea → Forgejo).
      
      All the above solutions make it super-easy to mirror a github project as well, just in case it goes away :) Doing so has saved my arse more than a few times when github takes a repo down for stupid reasons.
      
      Mandatory plug for !selfhosted@lemmy.world :)
      
      Gitlab seems too heavyweight to me. I use Gogs myself on my home server. No code review tools via PR ala github/gitlab, but I don’t need those in my web frontend.
the_lone_wolf@lemmy.ml
link
fedilink
English
arrow-up
9·
3 years ago
pirates sail your ship
Cyb3rManiak@kbin.social
link
fedilink
arrow-up
11
arrow-down
4·
3 years ago
Instructions unclear. Linked posting explains nothing. Will assume this is about 78 missing dragonballs and move on.

Jokes aside, we must preserve the 78 collection. What if in the future an alien signal will reach earth and no one can understand it because all 78s are extinct? We don’t have Starfleet to go back in time and get a 78 from San Francisco in the past to save the future!
backup_pirate_client@lemmy.dbzer0.com
link
fedilink
English
arrow-up
1·
3 years ago
Removed by mod
- Arghblarg@lemmy.caOP
  link
  fedilink
  English
  arrow-up
  2
  arrow-down
  1·
  3 years ago
  Aha! Well, coincidentally, a few weeks ago I just found out about another IA download tool for getting books that are hidden behind the borrow wall.
  
  DeGourou
  
  NOTE DeGourou is incompatible with the tool mentioned in my post here (Python library differences) so install it in a different account if you want to use both tools often. (Maybe someone more fluent in Python can find out why installing one breaks the other?)
  
  Now DeGourou seems to only download individual books. Would be great if it could be made to iterate over entire collections as well…
blindsight@beehaw.org
link
fedilink
arrow-up
1·
3 years ago
Copyright has completely jumped the shark. There’s absolutely no balance between the public benefit of the public domain.

30 years ought to be enough time for anyone to extract any reasonable value from an IP. If you haven’t made your profit in 30 years, then let the public benefit from it.

Or at least let preservationists (data hoarders, let’s be honest) keep our cultural history alive and accessible for future generations.
- Grimpen@lemmy.ca
  link
  fedilink
  English
  arrow-up
  1·
  3 years ago
  Or a renewal step. If it’s not worth renewing, let it into the public domain.
  
  This is why It’s A Wonderful Life became a Christmas classic. Because it was in the pubic domain, it was used as late night filler.
  
  The MPAA and RIAA miss the point. If It’s A Wonderful Life was still copyrighted, it wouldn’t have become a classic.
  
  It’s like the concept of Abandonware. If video games had a large copyright clearing house like the MPAA or RIAA, Abandonware wouldn’t work, but abandoned media will disappear. Heck, non-abandoned media also disappears because profits don’t reward preservation.

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ@lemmy.dbzer0.com

piracy@lemmy.dbzer0.com

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !piracy@lemmy.dbzer0.com

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don’t request invites, trade, sell, or self-promote

3. Don’t request or link to specific pirated titles, including DMs

4. Don’t submit low-quality posts, be entitled, or harass others

Loot, Pillage, & Plunder

We heartily recommend visiting the free port of freemediaheckyeah (aka FMHY) while you sail the high seas, for all the freshest links the ocean has to offer.

📜 c/Piracy Wiki (Community Edition):

Archived

🪶 Megathread (archived)

🏴‍☠️ Other communities

FUCK ADOBE!

!GenP@lemmy.dbzer0.com

Torrenting/P2P:

Gaming:

💰 Please help cover server costs.


Ko-fi	Liberapay

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

260 users / day
1.09K users / week
2.94K users / month
8.77K users / 6 months
8 local subscribers
69.8K subscribers
4.38K Posts
85.6K Comments
Modlog

Help back up the Great 78 Collections before the Record Companies force The Internet Archive to take them down!

Help back up the Great 78 Collections before the Record Companies force The Internet Archive to take them down!

Music labels sue Internet Archive over digitized record collection - Yiffit.net

Mandatory plug for !selfhosted@lemmy.world :)