Business

Is OpenAI using your content without permission?

The number of organizations accusing OpenAI of stealing their work continues to grow like extra patties on a burger, with a prominent news organization now joining the fray with its own set of claims against the Microsoft-backed artificial intelligence startup.

In a lawsuit filed against OpenAI, the Center for Investigative Reporting, the oldest nonprofit newsroom in the US, claims the ChatGPT maker used its investigative journalism to train and enhance its generative AI product without permission or compensation.

It’s a tale as old as time.

Ever since ChatGPT hit the scene, different quarters of the internet have been raising alarm bells over the data used to train generative AI, often, without permission. You’ve got artistsmusic labelsauthors, heck, even programmers, who have either sued or complained against the company for allegedly using their work to build ChatGPT and its derivatives.

“This free rider behavior is not only unfair, it is a violation of copyright,” Monika Bauerlein, CEO of the Center for Investigative Reporting, said in a statement.

Free rider behavior is perhaps the best way to describe what companies developing AI are doing.

Take Meta, for example. The social media giant admitted to using users’ Facebook and Instagram posts to develop an AI assistant. Meanwhile, ChatGPT has been found to produce verbatim paragraphs from novels, complete verbatim copies of poems, and even articles from The New York Times!

In fact, CopyLeaks estimates that nearly 60% of the responses provided by GPT-3.5 (which is the model behind ChatGPT) contain some form of plagiarized content, the Center for Investigative Reporting says.

Grim, isn’t it?

At this point, the entire output of humanity, creative or otherwise, is apparently a valid target for AI companies. The question then is, are gen AI companies just profiteering off of our work? Evidence seems to suggest so.

Reddit, for example, has already struck a deal with both OpenAI and Google to let them use content from its platform to make their AI products better. There’s an age old adage: the rich get richer, while the poor get poorer. That seems to fit with Reddit’s partnership with OpenAI and Google, as the company will earn millions of dollars off of the deals but will likely never share its earnings with the users whose posts are gobbled up by OpenAI and Google to fine tune their AI models.

OpenAI also has similar arrangements with the Associated PressAxel Springer, and TIME magazine to use up journalists’ work to (probably) make ChatGPT even better. Other tech companies probably have something lined up with major publications as well.

This means that people who create will be left to do the heavy lifting while some tech bro is going to feed all that raw material to produce more powerful generative AI products, likely without permission or compensation.

The Center for Investigative Reporting is one of a handful of organizations that have taken OpenAI to court, joining the likes of The New York Times and others like it for allegedly infringing on its copyrights.

Suing OpenAI is not cheap, though. As The Verge reports, The NYT has raked up $1 million in legal costs during Q1 after it began its legal action, and there’s no telling how long this entire saga will play out — assuming both parties don’t end up settling out of court.

However, the case(s) are perhaps significant in that they could determine how AI operates within the bounds of copyright. Until then, I guess OpenAI is going to be sailing the high seas. 🏴‍☠️ 🏴‍☠️ 🏴‍☠️ ☠️☠️☠️ #IYKYK 😉

OpenAI backer Microsoft topped HackerNoon’s Tech Company Rankings this week.


In Other News.. 📰

  • Crypto Industry Is About to Boom, Is Outperforming the Internet: Architect Partners — via CoinDesk
  • Figma disables its AI design feature that appeared to be ripping off Apple’s Weather app — via TechCrunch
  • Meta accused of breaking European law with its ‘pay or consent’ model — via CNN
  • OnlyFans vows it’s a safe space. Predators are exploiting kids there. — via Reuters
  • Meta’s Threads turns one, has more than 175 million active users — via Axios
  • China’s BYD is set to take Tesla’s crown as the world’s No. 1 producer of battery electric vehicles — via CNBC

And that’s a wrap! Don’t forget to share this newsletter with your family and friends

See y’all next week. PEACE! ☮️


This article was originally published by Sheharyar Khan on HackerNoon.

HackerNoon

Recent Posts

Reality intelligence startup Track3D raises $10M to tackle construction delays

Construction is one of the world’s most complex industries to manage. Projects run late, costs…

22 hours ago

UK to force digital ID, Blair Institute claims 62% of Brits favor digital identity

Illegal immigration is the Trojan Horse of choice to deliver mandatory digital ID: perspective Using…

1 day ago

97% of CIOs, CTOs concerned about unethical use of AI at companies: Report

Since the launch of OpenAI’s ChatGPT in late 2022, use of artificial intelligence (AI) has…

2 days ago

We can’t eat it, but AI will feed the world

Since its massification in the early 2020s, AI has been slowly integrated into sectors as…

7 days ago

To monitor disinformation Von der Leyen urges European Democracy Shield, Center for Democratic Resilience

The EU, UN, WEF, and G20 all call on stakeholders to mitigate the harmful effects…

1 week ago

Trump Takes Aim at Remote Work—Is He the Movement’s Top Adversary?

Back in 2018, I wrote a story, To Kill an Outsourcing Bird. For my younger readers,…

1 week ago