OpenAI has launched a web crawler to improve artificial intelligence models like GPT-4.
Called GPTBot, the system combs through the Internet to train and enhance AI's capabilities. Using GPTBot has the potential to improve existing AI models when it comes to aspects like accuracy and safety, according to a blog post by OpenAI.
"Web pages crawled with the GPTBot user agent may potentially be used to improve future models and are filtered to remove sources that require paywall access, are known to gather personally identifiable information (PII), or have text that violates our policies," reads the post.
Websites can choose to restrict access to the web crawler, however, and prevent GPTBot from accessing their sites, either partially or by opting out entirely. OpenAI said that website operators can disallow the crawler by blocking its IP address or on a site's Robots.txt file.
Previously, OpenAI has landed in hot water for how it collects data and for things like copyright infringement and privacy breaches. This past June, the AI platform was sued for "stealing" personal data to train ChatGPT.
Its opt-out functions were only recently implemented, with features like disabling chat history allowing users more control over what personal data can be accessed.
ChatGPT 3.5 and 4 were trained on online data and text dating up to Sept. 2021. There is currently no way to remove content from that dataset.
According to OpenAI, you can disallow GPTBot by adding it to your site's Robots.txt, which is essentially a text file that instructs web crawlers on what they can or cannot access from a website.
You can also customize what parts a web crawler can use, allowing certain pages and disallowing others.
Copyright © 2023 Powered by
OpenAI launches webcrawler GPTBot, and instructions on how to block it-逆水行舟网
sitemap
文章
9
浏览
85798
获赞
88
Facebook launches 'Facebook Shops' for more in
Facebook just made it way easier to spend your money on Instagram. On Tuesday, Facebook, which ownsBBC launches voice assistant that will learn regional accents
It's a highly frustrating moment talking to a voice assistant that doesn't understand your regional'A Bug's Life' fleshlight is here to ruin your childhood memories
If you're feeling particularly nostalgic about the '90s and in the mood to tarnish your precious chiAre you online shopping a lot during quarantine? Here are some of the psychological reasons why.
Before the coronavirus, many New Yorkers took advantage of living in one of the most expensive citieMom faceswaps her kid with Thomas the Tank Engine, and it's incredibly cursed
Faceswaps are inherently pretty terrifying. Who thought this was a good idea? The proportions neverTwitter tests feature that limits who can respond to tweets
Twitter is testing a new feature that could fundamentally change the nature of the platform.On WedneChina looks to retaliate against U.S. over ‘unreasonable suppression’ of Huawei
Regardless of everything elsegoing on, the U.S. vs. Huaweisaga doesn’t seem to be slowing downKiller Mike's viral speech cuts to the heart of nationwide protests
Chaos has overtaken the streets of multiple cities in the wake of George Floyd's death, and Killer MChris Evans passionately defends Cool Ranch Doritos amidst heated chip debate
Chris Evans loves Cool Ranch Doritos, and he's not about to apologize for his good taste.After comedBBC launches voice assistant that will learn regional accents
It's a highly frustrating moment talking to a voice assistant that doesn't understand your regionalElon Musk well actually'd Grimes over their baby name just after she gave birth
Elon Musk and Grimes welcomed little X Æ A-12 Musk into the world on Monday, but Musk's TwitteTwitter's trending section is an extra hellish minefield during the pandemic
Even on its best day, Twitter's trending section wasn't exactly a stellar feature. Seemingly droppedLenovo Flex 5G laptop now available through Verizon
5G isn't just for phones. Starting this week, you can buy a real, actual laptop that connects to theLost recipes resurface on Facebook, and now we’re eating like crazy
Internet of Yumdigs into all the things that make us drool while we're checking our feeds.“HavVirtual internships and the Zoom skills you don't learn in college
With the spread of the coronavirus, summer internships — once a staple of collegiate and post-