[ai-control] Background reading

Mark Nottingham <mnot@mnot.net> Fri, 15 March 2024 05:18 UTC

Return-Path: <mnot@mnot.net>
X-Original-To: ai-control@ietfa.amsl.com
Delivered-To: ai-control@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 3F98AC14F6B2 for <ai-control@ietfa.amsl.com>; Thu, 14 Mar 2024 22:18:58 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.806
X-Spam-Level:
X-Spam-Status: No, score=-2.806 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_BLOCKED=0.001, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=mnot.net header.b="NMJzEaCZ"; dkim=pass (2048-bit key) header.d=messagingengine.com header.b="Y2hUr1aB"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id nVCvSrLfKC-l for <ai-control@ietfa.amsl.com>; Thu, 14 Mar 2024 22:18:53 -0700 (PDT)
Received: from fout2-smtp.messagingengine.com (fout2-smtp.messagingengine.com [103.168.172.145]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 3CAECC14F5F5 for <ai-control@ietf.org>; Thu, 14 Mar 2024 22:18:49 -0700 (PDT)
Received: from compute1.internal (compute1.nyi.internal [10.202.2.41]) by mailfout.nyi.internal (Postfix) with ESMTP id 16FC91380100 for <ai-control@ietf.org>; Fri, 15 Mar 2024 01:18:49 -0400 (EDT)
Received: from mailfrontend1 ([10.202.2.162]) by compute1.internal (MEProxy); Fri, 15 Mar 2024 01:18:49 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=mnot.net; h=cc :content-transfer-encoding:content-type:content-type:date:date :from:from:in-reply-to:message-id:mime-version:reply-to:subject :subject:to:to; s=fm2; t=1710479929; x=1710566329; bh=shMSZdMcCA XWNmanTLFjGcioqr6BIhiCxMV0iy0BWwI=; b=NMJzEaCZlfemN/MF2N7ggee5vF L+v4rHzZMT/ZKCG7m1+iAMcW7fUQRq3/y6+oQdMBC2uQyAyFmhr0qyKP2W+0V+sj YBOCf6H2L4LqGe5SAkfcIMg+61dOxbolZjoYWk8Gn9/7/Cz7w+XRDwPwfFxghKii tXjWdDEhcvl14zBLDuU9+Wzyg0Lm6o8vD2bEbxxCsDTGf818kNtz1edJXLiRUaox 8ByPbGLysIMWGH/d8zwQQH8vefgfAt0MmOavR26P8xcYACOncNvWNKOsWavti2xz 1rToVNbhIFP7iupiW5deGva3HtoaGgtR/bnffqBnAhduCW1vNBbdXCtWAqjg==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :content-type:date:date:feedback-id:feedback-id:from:from :in-reply-to:message-id:mime-version:reply-to:subject:subject:to :to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s= fm1; t=1710479929; x=1710566329; bh=shMSZdMcCAXWNmanTLFjGcioqr6B IhiCxMV0iy0BWwI=; b=Y2hUr1aB8cb7+w2s78LTwaBQaGdt7XLix5cErmJ4oQ0z BLUoUTjBT6G5zQ/65junAmM41Sh0z30q89GmpUd9tPQySpCfa2i84K1IQ9mdpB7s wXp0jPCDg57yT3rLV92iUVKTuowTfkM896bez0OPL9NW4Svuyo/p8VkaaPbqLOHS +lHwkpV/orQJk2ks6/0nLCDL0sAeempmmMb/NnJLz8m9+CNDZnEz2chbtgeyGx3R 8gyVAp90EFGi4l9TvDvFnAK3tZk140MeyD3U4bXucsUl+o7XduSk46FO6K7BK/gS C09Jj1OjRQrJM7Onr7viePl6kKn0oHWn88kE94Tidw==
X-ME-Sender: <xms:ONrzZY4A_4A6Lkg0rga3jr5HGI14Nbg0is88xyEjRxdo6LOWdZfBAA> <xme:ONrzZZ7s8PV8AkqJ8alu4r3A50Ce4LrpcVu5gCmcnysN1BViR51DMqMlfCJ2eARgg A35fgjIQzv4draAcQ>
X-ME-Received: <xmr:ONrzZXcLM0kWyvKK4-I8qQend9we7B_jELpC5goesgh8puo8X7NuPBJNE6odhRGCGStwQQpjEIlgj7mYFXeb9YSZ7Vpv6x7oWTneqXFKDDeP1xUOjW8e9b4D>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvledrjeekgdejlecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecuogfuuhhsphgvtghtffhomhgrihhnucdlgeelmdenuc fjughrpefhtgfgggfukfffvffosehtqhhmtdhhtddvnecuhfhrohhmpeforghrkhcupfho thhtihhnghhhrghmuceomhhnohhtsehmnhhothdrnhgvtheqnecuggftrfgrthhtvghrnh ephfeujeeiteduhefgvdehiefhteffheeuudegvdfhgfekffdvffekieffgedtgfdtnecu ffhomhgrihhnpehthhgvvhgvrhhgvgdrtghomhdpphhrihhvrggthihjohhurhhnrghlrd hnvghtpdgvfhhfrdhorhhgpdhprghlvgifihdrrhgvpdhorhhighhinhgrlhhithihrdgr ihdpohigrdgrtgdruhhkpdguvghvihgrnhhtrghrthhsuhhpphhorhhtrdgtohhmpdhmvg guihhumhdrtghomhdpshhquhgrrhgvshhprggtvgdrtghomhdpshhusghsthgrtghkrdgt ohhmpdgtlhhouhgufhhlrghrvgdrtghomhdpsghlohhgrdhgohhoghhlvgdpohhpvghnrg hirdgtohhmpdgtohhmmhhonhgtrhgrfihlrdhorhhgpdhsphgrfihnihhnghdrrghipdhg ihhthhhusgdrtghomhdprgigihhoshdrtghomhdpthgvtghhnhholhhlrghmrgdrtghord hukhdpvghurhhophgrrdgvuhdpmhhnohhtrdhnvghtnecuvehluhhsthgvrhfuihiivgep tdenucfrrghrrghmpehmrghilhhfrhhomhepmhhnohhtsehmnhhothdrnhgvth
X-ME-Proxy: <xmx:ONrzZdI3oA8mIJPTRszNXUNAmQ_JfmxCjAXWKHj7Gd59B0AnRqOW_g> <xmx:ONrzZcI0OmcCXRQC4ue-YzF5um75hPNYXFtJLBcaYD6bWhntoqET-g> <xmx:ONrzZexe7Z7_4K-Rw7vi18fnuGO8U3GJL7Ab-PemnUUOx3aJiDa1kg> <xmx:ONrzZQIEh4rOb5wzZVlKtYxvXwCwpp2zZzUB2jzn225rWBB_jd8vLQ> <xmx:OdrzZTVYGlAQRciKkHAnwLI8kQMHG9jaN-c6G1NXAZXXp8PlsXC9yQ>
Feedback-ID: ie6694242:Fastmail
Received: by mail.messagingengine.com (Postfix) with ESMTPA for <ai-control@ietf.org>; Fri, 15 Mar 2024 01:18:47 -0400 (EDT)
From: Mark Nottingham <mnot@mnot.net>
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3774.500.171.1.1\))
Message-Id: <3BA0785E-EDE1-46B3-B844-AD19FEB44A6C@mnot.net>
Date: Fri, 15 Mar 2024 16:18:45 +1100
To: ai-control@ietf.org
X-Mailer: Apple Mail (2.3774.500.171.1.1)
Archived-At: <https://mailarchive.ietf.org/arch/msg/ai-control/SrGx2pnqx19BwftIpTS9J0Anoig>
Subject: [ai-control] Background reading
X-BeenThere: ai-control@ietf.org
X-Mailman-Version: 2.1.39
Precedence: list
List-Id: AI Control <ai-control.ietf.org>
List-Unsubscribe: <https://www.ietf.org/mailman/options/ai-control>, <mailto:ai-control-request@ietf.org?subject=unsubscribe>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ai-control/>
List-Post: <mailto:ai-control@ietf.org>
List-Help: <mailto:ai-control-request@ietf.org?subject=help>
List-Subscribe: <https://www.ietf.org/mailman/listinfo/ai-control>, <mailto:ai-control-request@ietf.org?subject=subscribe>
X-List-Received-Date: Fri, 15 Mar 2024 05:18:58 -0000

I'm sure there's more, but here are some links that may be relevant / good to be aware of.

# General / overview

* Verge article <https://www.theverge.com/24067997/robots-txt-ai-text-file-web-crawlers-spiders>
* Privacy Journal <https://www.privacyjournal.net/block-llm-crawlers/>
* EFF article <https://www.eff.org/deeplinks/2023/12/no-robotstxt-how-ask-chatgpt-and-google-bard-not-use-your-website-training>
* AI blocking survey <https://palewi.re/docs/news-homepages/openai-gptbot-robotstxt.html>
* Another survey <https://originality.ai/ai-bot-blocking>
* And another <https://reutersinstitute.politics.ox.ac.uk/how-many-news-websites-block-ai-crawlers>

# Steps taken to block

* DeviantArt "noai" meta tags <https://www.deviantartsupport.com/en/article/how-do-i-change-the-noai-setting-on-my-deviations>
* Medium's stance <https://blog.medium.com/default-no-to-ai-training-on-your-stories-abb5b4589c8>
* Squarespace's controls <https://support.squarespace.com/hc/en-us/articles/360022347072-Excluding-your-site-from-AI-scans>
* Substack's controls <https://support.substack.com/hc/en-us/articles/20382615953556-How-can-I-block-AI-from-using-my-Substack-publication-to-train-their-models>
* Cloudflare announcement <https://blog.cloudflare.com/ai-bots/>

# AI vendor docs / efforts

* Google blog <https://blog.google/technology/ai/ai-web-publisher-controls-sign-up/>
* OpenAI robots.txt instructions <https://platform.openai.com/docs/gptbot>
* Common Crawl (used by some AI but also for other purposes) <https://commoncrawl.org/ccbot>
* Nothing from Anthropic

# Other

* ai.txt proposal <https://spawning.ai/ai-txt>
* Another ai.txt proposal <https://github.com/menro/ai.txt>
* Model collapse is another potential reason to label / control AI usage <https://www.axios.com/2023/08/28/ai-content-flood-model-collapse>
* There seems to be a hook for legal support of opt-out mechanisms in the EU's DSM Directive / AI Act <https://www.technollama.co.uk/the-eu-ai-act-and-copyright>
* Specifically, see Art IV (3) <https://eur-lex.europa.eu/legal-content/EN/TXT/HTML/?uri=CELEX:32019L0790#d1e986-92-1>

--
Mark Nottingham   https://www.mnot.net/