duplicates – Techdirt (original) (raw)
Stories filed under: "duplicates"
Content Moderation Case Study: Dealing With Podcast Copycats (2020)
from the podcast-yourself,-but-not-someone-else dept
Summary: Since the term was first coined in 2004, podcasts have obviously taken off, with reports saying that around 55% of Americans have listened to a podcast as of early 2021. Estimates on a total number of podcasts vary, but some sites estimate the total at 1.75 million podcasts, with about 850,000 of them described as ?active.? Still, for many years, actually hosting a podcast remained somewhat complicated.
A few services have been created to try to make it easier, and one of the biggest names was Anchor.fm, which tried to make it extremely easy to create and host a podcast — including the ability to add in an advertising-based monetization component. In early 2019, as part of its aggressive expansion into podcasts, Spotify purchased Anchor for $150 million.
However, in the summer of 2020, podcasters began calling out Anchor for allowing others to re-upload copies of someone else?s podcasts, claim them as their own, and monetize those other podcasts. Erika Nardini from Barstool Sports called this out on Twitter, after seeing a variety of Barstool podcasts show up on Anchor, despite not being uploaded there by Barstool.
The issue got a lot more attention a month later when podcaster Aaron Mahnke wrote a thread detailing how a variety of popular podcasts were being reuploaded to Anchor and monetized by whoever was uploading them.
After that thread started to go viral, Anchor promised to crackdown on copied/re-uploaded podcasts. The company claimed that it had an existing system in place to detect duplicates, but that those doing the uploading had figured out some sort of workaround, by manually uploading the podcasts, rather than automating the effort:
The copycats, Mignano says, found a workaround in Anchor?s detection system. ?This is definitely a new type of attack for Anchor,? he says. The people who uploaded these copycat shows downloaded the audio from another source, manually reuploaded it to Anchor, and filled in the metadata, essentially making it appear to be a new podcast.
This manual process, he says, makes uploading copycat shows more time-intensive and therefore less appealing and only achievable on a small scale. He says the company found ?a few dozen? examples out of the more than 650,000 shows uploaded to Anchor this year. (In contrast, people can also upload shows more automatically by pasting an RSS feed link into Anchor, but the company would seemingly detect if someone tried to upload a popular show?s feed and pass it off as their own.)
?The good news is that so many creators are using Anchor, and that growth has been far more than I think we projected, which is great, but I think the downside in this case is that, with any rapidly growing platform, that has brought on some growing pains and we need to do a better job of anticipating things like this,? he says. ?We?re working right now to ensure that our copycat detection and creator outreach continues to improve to keep pace.?
Decisions to be made by Anchor/Spotify:
- How do you detect which podcasts are copies from elsewhere, especially when the original versions may not have originated on Anchor?
- People can always work around attempts to block copycats, so what kind of review process can be put in place to prevent abuse?
- Will being too aggressive at preventing abuse potentially lead to taking down too much? For example, what if one podcast uses clips from another for the purpose of commentary?
- Should there be extra validation or other hurdles to turn on monetization?
Questions and policy implications to consider:
- What are the trade-offs in making it especially easy to host, distribute and monetize podcasts? Is it worth making it so easy when that process will likely be abused?
- Is there a middle ground that allows for the easy creation, distribution and monetization of audio content that won?t be as subject to abuse?
- Is there a risk that cracking down on copycat content itself could go too far and silence commentary and criticism?
While Anchor/Spotify continue to update their practices, in the midst of all of this, another story came out (with less attention) showing how being too aggressive in stopping copycats can also backfire. This story actually came out a month before Anchor said it would beef up its process for stopping copycats, and involved a podcaster who wanted to test out Anchor to see if he should recommend it on his own podcast. That podcaster, Jonathan Mendonsa, posted a test episode to Anchor to see how it works, only to have his entire account shut down with no explanation or notice.
Mendonsa was surprised to find out that Anchor was comparing audio he uploaded to Anchor to audio uploaded elsewhere, and felt that the decision to completely shut down his account immediately was perhaps a step too far. From the story at PodNews:
Jonathan told Podnews: ?I was testing Anchor to see if I would recommend it to my podcast course students. This ?duplicate content” caused them to not only take down the episode but to actually shut down my account entirely without no recourse or notice. What does that mean when a podcaster wants to republish an old episode? Or use a clip from another episode?”
For this show to have been pulled within two hours of posting must mean that Anchor is automatically comparing audio uploaded to their platform with all audio already available on Spotify – since this audio was identical to an episode already there.
?Anchor needs to clarify the tech they are using, and what triggers this,? Jonathan told us. ?I never considered that any podcast platform would be looking for duplicate content, so I just used the same trailer. I wouldn?t be mad if it got flagged or the episode got unpublished – but to delete the entire account??
It?s interesting to note the challenges on both sides of this issue, with some being upset that Anchor makes it too easy to distribute duplicate content, and others being upset at how quickly and aggressively it responds to duplicate content.
Originally posted to the Trust & Safety Foundation website.
Filed Under: content moderation, copycats, duplicates, podcasts
Companies: anchor.fm, spotify