Close Menu
Innovation Village | Technology, Product Reviews, Business
    Facebook X (Twitter) Instagram
    Sunday, May 18
    • About us
      • Authors
    • Contact us
    • Privacy policy
    • Terms of use
    • Advertise
    • Newsletter
    • Post a Job
    • Partners
    Facebook X (Twitter) LinkedIn YouTube WhatsApp
    Innovation Village | Technology, Product Reviews, Business
    • Home
    • Innovation
      • Products
      • Technology
      • Internet of Things
    • Business
      • Agritech
      • Fintech
      • Healthtech
      • Investments
        • Cryptocurrency
      • People
      • Startups
      • Women In Tech
    • Media
      • Entertainment
      • Gaming
    • Reviews
      • Gadgets
      • Apps
      • How To
    • Giveaways
    • Jobs
    Innovation Village | Technology, Product Reviews, Business
    You are at:Home»Artificial Intelligence»OpenAI launches webcrawler GPTBot, and instructions on how to block it
    ChatGPT

    OpenAI launches webcrawler GPTBot, and instructions on how to block it

    1
    By Tapiwa Matthew Mutisi on August 8, 2023 Artificial Intelligence, Chat, chatbot, Information Technology, Internet, Technology

    OpenAI has launched a web crawler to improve artificial intelligence models like GPT-4. Called GPTBot, the system combs through the Internet to train and enhance AI’s capabilities. Using GPTBot has the potential to improve existing AI models when it comes to aspects like accuracy and safety, according to a blog post by OpenAI.

    The post reads:

    Web pages crawled with the GPTBot user agent may potentially be used to improve future models and are filtered to remove sources that require paywall access, are known to gather personally identifiable information (PII), or have text that violates our policies.

    Websites can choose to restrict access to the web crawler, however, and prevent GPTBot from accessing their sites, either partially or by opting out entirely. OpenAI said that website operators can disallow the crawler by blocking its IP address or on a site’s Robots.txt file.

    Previously, OpenAI has landed in hot water for how it collects data and for things like copyright infringement and privacy breaches. This past June, the AI platform was sued for “stealing” personal data to train ChatGPT.

    Its opt-out functions were only recently implemented, with features like disabling chat history allowing users more control over what personal data can be accessed.

    ChatGPT 3.5 and 4 were trained on online data and text dating up to Sept. 2021. There is currently no way to remove content from that dataset.

    How to prevent GPTBot from using your website’s content

    According to OpenAI, you can disallow GPTBot by adding it to your site’s Robots.txt, which is essentially a text file that instructs web crawlers on what they can or cannot access from a website.

    The code for disallowing GPTBot from your site.
    Credit: Screenshot / OpenAI

    You can also customize what parts a web crawler can use, allowing certain pages and disallowing others.

    The code for disallowing or allowing GPTBot from your site's pagess.
    Credit: Screenshot / OpenAI

    Related

    AI artificial intelligence (AI) ChatGPT data GPT-4 Internet OpenAI Technology Webcrawler website
    Share. Facebook Twitter Pinterest LinkedIn Email
    Tapiwa Matthew Mutisi
    • Facebook
    • X (Twitter)
    • LinkedIn

    Tapiwa Matthew Mutisi has been covering blockchain technology, intelligent technologies, cryptocurrency, cybersecurity, telecommunications technology, sustainability, autonomous vehicles, and other topics for Innovation Village since 2017. In the years since, he has published over 4,000 articles — a mix of breaking news, reviews, helpful how-tos, industry analysis, and more. | Open DM on Twitter @TapiwaMutisi

    Related Posts

    If You’re a Junior Developer, This Gemini Update Could Be a Game-Changer

    Remote Work and Work From Home Are Not the Same — Here’s the Real Difference

    Microsoft Lays Off 3% of Workforce Amid Rising AI Investment Costs

    1 Comment

    1. Pingback: OpenAI acquires start-up Global Illumination to work on core products, ChatGPT - Innovation Village | Technology, Product Reviews, Business

    Leave A Reply Cancel Reply

    You must be logged in to post a comment.

    Copyright ©, 2013-2024 Innovation-Village.com. All Rights Reserved

    Type above and press Enter to search. Press Esc to cancel.