Sunday, 19 January 2025
25.9 C
Singapore

Microsoft’s AI boss thinks it’s okay to use open web content freely

Microsoft's AI chief claims open web content is fair game for copying and use, sparking legal and ethical debates on copyright and AI.

Microsoft’s AI chief, Mustafa Suleyman, recently sparked controversy with his views on using content from the open web. During an interview with CNBC’s Andrew Ross Sorkin, Suleyman suggested that any content published on the open web becomes “freeware,” allowing anyone to copy and use it without restriction.

Suleyman’s perspective on open web content

When Sorkin asked whether AI companies have been effectively stealing intellectual property (IP) worldwide, Suleyman responded confidently. He stated that since the 1990s, there has been a social contract regarding content on the open web, treating it as fair use. According to him, anyone can copy, recreate, or reproduce such content freely.

His stance comes amid several lawsuits accusing Microsoft and OpenAI of using copyrighted online stories to train their generative AI models. While it’s not surprising to hear a Microsoft executive defend their practices, Suleyman’s public stance has raised eyebrows due to its boldness and potential legal inaccuracies.

It’s important to clarify that in the US, any work created is automatically protected by copyright the moment it’s made. There’s no need to apply for it; simply publishing it on the web does not void these rights. Waiving these rights is so challenging that special web licenses have been created to help manage them.

Fair use, on the other hand, is not determined by a social contract but by legal proceedings. It is a legal defence allowing some uses of copyrighted material, evaluated based on what is copied, why, how much, and whether it harms the copyright owner. Despite this, many AI companies, including Microsoft, argue that training AI models on copyrighted content falls under fair use, although few have been as forthright as Suleyman in their claims.

The debate over robots.txt

Suleyman also touched upon the concept of robots.txt, a text file websites use to instruct bots on which parts of the site they are allowed to crawl. He suggested that if a website explicitly states it should not be scraped for any purpose other than indexing, this constitutes a grey area that needs legal clarification.

While robots.txt is not a legal document, it has been a social contract since the ‘90s, guiding bots on proper web scraping etiquette. However, some AI companies, including Microsoft’s partner OpenAI, reportedly disregard these instructions, further complicating the debate.

Suleyman’s comments and the ongoing lawsuits highlight the tension between technological advancement and intellectual property rights. As the courts continue to address these issues, the legal landscape surrounding AI and copyright will likely evolve, impacting how content is used and protected in the digital age.

Hot this week

Mark Zuckerberg draws parallels between Meta’s AI practices and YouTube’s copyright policies

Mark Zuckerberg compares Meta’s AI copyright approach to YouTube’s handling of pirated content amidst ongoing legal battles over AI training datasets.

China may allow Elon Musk to acquire TikTok’s US division

China may consider selling TikTok US to Elon Musk if the app is banned. ByteDance ownership remains preferred but uncertain.

Nvidia criticises Biden’s AI chip rules while seeking Trump’s support

Nvidia criticises Biden’s new AI chip restrictions, aligning with Trump’s policies while highlighting risks to US innovation and global competitiveness.

DJI Flip: A US$439 foldable camera drone built for portability

Discover the DJI Flip, a US$439 foldable camera drone with 4K recording, 48MP photos, and 31-minute battery life, perfect for photographers on the go.

Sterra launches dehumidifiers to improve home comfort and air quality

Sterra introduces the Ray and Titan dehumidifiers, offering advanced humidity control and air purification for healthier, more comfortable homes.

ASUS introduces ProArt Display 5K PA27JCV for creative professionals

ASUS unveils the ProArt Display 5K PA27JCV, a 27-inch monitor offering 5K resolution, Delta E<2 colour accuracy, and advanced features for creators.

Character AI tests games on its platform to boost user engagement

Character AI introduces games to its platform to boost user engagement and enhance its entertainment offerings.

Canoo files for bankruptcy, ending seven years of EV innovation

Canoo, a seven-year-old EV startup, filed for bankruptcy and ceased operations after failing to secure funding.

Perplexity acquires Read.cv, a professional networking platform

Perplexity acquires professional networking platform Read.cv, ending its operations. Users can export data until May 16 as domains shift to Hello.cv.