[ai-control] Fwd: [arch-d] Call for Papers: IAB Workshop on AI-CONTROL

Martin Thomson <mt@lowentropy.net> Thu, 11 July 2024 01:16 UTC

Return-Path: <mt@lowentropy.net>
X-Original-To: ai-control@ietfa.amsl.com
Delivered-To: ai-control@ietfa.amsl.com
Received: from localhost (localhost [127.0.0.1]) by ietfa.amsl.com (Postfix) with ESMTP id 322A4C14F5F4 for <ai-control@ietfa.amsl.com>; Wed, 10 Jul 2024 18:16:29 -0700 (PDT)
X-Virus-Scanned: amavisd-new at amsl.com
X-Spam-Flag: NO
X-Spam-Score: -2.809
X-Spam-Level:
X-Spam-Status: No, score=-2.809 tagged_above=-999 required=5 tests=[BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H2=-0.001, RCVD_IN_ZEN_BLOCKED_OPENDNS=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, URIBL_DBL_BLOCKED_OPENDNS=0.001, URIBL_ZEN_BLOCKED_OPENDNS=0.001] autolearn=ham autolearn_force=no
Authentication-Results: ietfa.amsl.com (amavisd-new); dkim=pass (2048-bit key) header.d=lowentropy.net header.b="PtSENKrX"; dkim=pass (2048-bit key) header.d=messagingengine.com header.b="GQ5ff6Kw"
Received: from mail.ietf.org ([50.223.129.194]) by localhost (ietfa.amsl.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id AgriGivVJmGx for <ai-control@ietfa.amsl.com>; Wed, 10 Jul 2024 18:16:24 -0700 (PDT)
Received: from fout6-smtp.messagingengine.com (fout6-smtp.messagingengine.com [103.168.172.149]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by ietfa.amsl.com (Postfix) with ESMTPS id 7BCBFC14F5EE for <ai-control@ietf.org>; Wed, 10 Jul 2024 18:16:24 -0700 (PDT)
Received: from compute6.internal (compute6.nyi.internal [10.202.2.47]) by mailfout.nyi.internal (Postfix) with ESMTP id 7AA371380E6F for <ai-control@ietf.org>; Wed, 10 Jul 2024 21:16:23 -0400 (EDT)
Received: from imap41 ([10.202.2.91]) by compute6.internal (MEProxy); Wed, 10 Jul 2024 21:16:23 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lowentropy.net; h=cc:content-transfer-encoding:content-type:content-type:date :date:from:from:in-reply-to:message-id:mime-version:reply-to :subject:subject:to:to; s=fm3; t=1720660583; x=1720746983; bh=2F KC0cVvdcUcWSCuB++DenhJr9ugbUJD3jYUi+5WMB8=; b=PtSENKrXJgiyB2hCWR d8m7NS9o1ga3ZhdrqTNmgvafzJTXA1Y5PK/kAI+OFPRspW/O4xZKAnkh/YExih3P ZTE8SsmKfB4d2VSIK1+C9tWJisEwDUnQubz/fRA3p6IVP6XIuZYaBzt4tDlZT+JK qe+sTDz6319xKnOQjC8nEiCXyPn5i2Yc9u6Yc2A1DAz/D856Jxhkkv+pMFp5pAMc TqK2ooYu/yQ3icL5QISSrgswNnB6yYGDUuI3WjW1IxGZrFxklzPb3g0lE5dDfKWz HLEqvRbwJLZyn3I2cY+utyIxVzsFKW/0PLlMo4U9sn6KMThorh8Pmt+39t42DlcQ BkLQ==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:content-transfer-encoding:content-type :content-type:date:date:feedback-id:feedback-id:from:from :in-reply-to:message-id:mime-version:reply-to:subject:subject:to :to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s= fm2; t=1720660583; x=1720746983; bh=2FKC0cVvdcUcWSCuB++DenhJr9ug bUJD3jYUi+5WMB8=; b=GQ5ff6KwY7ouRFe+x5cW0XZIpX9o1RNNPAF3uwGWRC7S snIxGXqY5eOE7N+3iUraRSV48TJwmkL1IDvHQVR1oESBiEXluDufILqtoRL18MeO z5DEXPSoDMbhhBfd9kAx00yVR+IGcGXyHW+q0BiowI7T840/+uIr8gG4cyt3M2h3 SO9IPR/T6jhK6k+DoPKneRfnJnBPYU99n1nqINaNiblw1+cSc5D+9mU+YTw6WDTl uxmkVXZhmpcJFW5LXFCQxswjYi1pUA9wq6ZEJGNZ7wxMkCFLg7rq6dZPBnPwlmeI M+Ct0oE0SdlKmBG3MhzhZukUeMBoO+WY1ae6bcjRqw==
X-ME-Sender: <xms:ZjKPZp-N8FS9Lc1wOQHmxlUgNY0ktZ_bD7tghQmuC0EHb5r0l8R0NA> <xme:ZjKPZttCMwJCNx1WgyWIJ0C1FsNe9r_4Y44P5ifEKW2Udb6WhTTVFZBUTW-8z5yN5 -W9Ui6oqGXe0txw67A>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddrfeefgdeghecutefuodetggdotefrodftvf curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu uegrihhlohhuthemuceftddtnecunecujfgurhepofgfggfkfffhvffutgfgsehtqhertd erreejnecuhfhrohhmpedfofgrrhhtihhnucfvhhhomhhsohhnfdcuoehmtheslhhofigv nhhtrhhophihrdhnvghtqeenucggtffrrghtthgvrhhnpeeuueegkeffheduffelgfejue dttedvheeljefgveeltdelvefgudfhleetfefhvdenucffohhmrghinhepihgvthhfrdho rhhgnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhfrhhomhepmh htsehlohifvghnthhrohhphidrnhgvth
X-ME-Proxy: <xmx:ZjKPZnCRKIGNRxMsV6bRg_nQlYpMB1Ufr8riPK9nPXQpQI29xDS7Rg> <xmx:ZjKPZtf4E-mN5mQCSS6lvPLiwS2HNb5ACvvI1kN_ru92GMPUk4A82w> <xmx:ZjKPZuPhiUOGgoO4mU7xPbb6BLGws8zJA6PXVm9zchIq71dYuLwLrQ> <xmx:ZjKPZvm70TpQiQwCuSCpfYddIZGb6GLnFvbSS2hOoQ1MygaGIcLU5A> <xmx:ZzKPZoVsDJY-ahsD-i0prOxtRH9prRwFfwy8bC0A0EcynqxtO0qfH7Ta>
Feedback-ID: ic129442d:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501) id 3FF842340080; Wed, 10 Jul 2024 21:16:22 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.11.0-alpha0-568-g843fbadbe-fm-20240701.003-g843fbadb
MIME-Version: 1.0
x-forwarded-message-id: <172057035354.722600.5657735483744741788@dt-datatracker-5f88556585-j5r2h>
Message-Id: <ac40e3b2-2c2e-43a4-bb8e-a82320022ded@betaapp.fastmail.com>
Date: Thu, 11 Jul 2024 11:16:02 +1000
From: Martin Thomson <mt@lowentropy.net>
To: "ai-control@ietf.org" <ai-control@ietf.org>
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable
Message-ID-Hash: ZUXO5XTKIBIS74VWTD2SRX4NOMNZHYC3
X-Message-ID-Hash: ZUXO5XTKIBIS74VWTD2SRX4NOMNZHYC3
X-MailFrom: mt@lowentropy.net
X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header
X-Mailman-Version: 3.3.9rc4
Precedence: list
Subject: [ai-control] Fwd: [arch-d] Call for Papers: IAB Workshop on AI-CONTROL
List-Id: AI Control <ai-control.ietf.org>
Archived-At: <https://mailarchive.ietf.org/arch/msg/ai-control/eBZIwB3tfz1WoxS5pKw602gha-o>
List-Archive: <https://mailarchive.ietf.org/arch/browse/ai-control>
List-Help: <mailto:ai-control-request@ietf.org?subject=help>
List-Owner: <mailto:ai-control-owner@ietf.org>
List-Post: <mailto:ai-control@ietf.org>
List-Subscribe: <mailto:ai-control-join@ietf.org>
List-Unsubscribe: <mailto:ai-control-leave@ietf.org>

This is 100% relevant for this list.

----- Original message -----
From: IAB Executive Administrative Manager <execd@iab.org>
To: IETF Announcement List <ietf-announce@ietf.org>
Cc: architecture-discuss@ietf.org
Subject: [arch-d] Call for Papers: IAB Workshop on AI-CONTROL
Date: Wednesday, July 10, 2024 10:12

AI-CONTROL Workshop
An Internet Architecture Board Workshop

Webpage: https://datatracker.ietf.org/group/aicontrolws/about/

Large Language Models and other machine learning techniques require voluminous input data, and one common source of such data is the Internet -- usually, "crawling" Web sites for publicly available content, much in the same way that search engines crawl the Web.

This similarity has led to an emerging practice of allowing the Robots Exclusion Protocol (RFC 9309) to control the behavior of AI-oriented crawlers.

This emerging practice raises many design and operational questions. It is not yet clear whether robots.txt (the mechanism specified by RFC 9309) is well-suited to controlling AI crawlers. A content creator or host may not be able to distinguish a crawler used for search indexing from a crawler used for LLM ingest – and indeed some crawlers may be used for both purposes. Potential use cases may extend across many different units of content, policies to be signaled, and types of content creators. Before robots.txt becomes a de facto solution to AI crawling opt-out, it is necessary to examine whether it is an appropriate mechanism: in particular, whether the creator of a particular unit of content can realistically and fully exercise their right to opt-out, and the scope of data ingest to which that opt-out applies.

This workshop aims to explore practical opt-out mechanisms for AI, and build an understanding of use cases, requirements, and other considerations in this space. The workshop will focus on mechanisms to communicate the opt-out choice and their associated data models. Technical enforcement of opt-out signals is not in scope.

The IAB is looking for short position papers on the following topics; however, this list is non-exhaustive and should be interpreted broadly:

* User stories, use cases, and requirements for opting content out of inclusion in large language models, from a variety of sources including but not limited to the Web
* Interactions between opt-out mechanisms and different use cases for AI
* Advantages and/or deficiencies of reusing robots.txt for controlling AI crawlers on the Web
* Comparisons of use cases for crawling opt-out
* Desired properties of an AI opt-out mechanism
* Potential developments in AI that may require adjustments in opt-out mechanisms
* Implications of legal/policy frameworks (e.g., copyright, privacy, research ethics) and requirements on the design of opt-out mechanisms
* Evolution of opt-out signals

Because robots.txt is emerging as a solution in this space, the discussion will be anchored on it as a starting point, but not limited to that mechanism. Proposals for alternative solutions may be made, but time will not be available for a detailed presentation or discussion.

Interested participants are invited to submit position papers on the workshop topics. Participants can choose their preferred format, including Internet-Drafts, text- or word-based documents, or papers formatted similar as used by academic publication venues. Submission as PDF is preferred. Paper size is not limited, but brevity is encouraged. By default, submissions that are considered relevant will be published on the workshop website. If you wish for your submission to be anonymised or withheld from such publication, please indicate that clearly in the submission.

The organizers will issue invitations based on the submissions received. Sessions will be organized according to the submissions received, and not every accepted submission or invited attendee will have an opportunity to present; the intent is to foster an active discussion and not simply to have a sequence of presentations.

Discussion at the workshop will be held under Chatham House rule, and therefore will not be recorded or minuted. However, a workshop report will be published afterwards. It is anticipated that the workshop report will include:
A list of participants (unless they request to be withheld)
Documentation of use cases and requirements discussed
Recommendations for IETF standards work to be considered (if any)
Recommendations for non-IETF standards work to be considered (if any)

The workshop will be by invitation only. Those wishing to attend should submit a position paper to ai-control-workshop-pc@iab.org. Position papers from those not planning to attend the workshop themselves are also encouraged.

Logistics:

- Submissions Due: 2 August 2024
- Invitations Issued by: 15 August 2024
- Workshop Dates: Two-day workshop during the week of September 16. The exact dates are to be confirmed soon.
- Workshop Location: Washington, DC area. The exact location to be confirmed soon.

Feel free to contact the Program Committee with any further questions: ai-control-workshop-pc@iab.org.

_______________________________________________
Architecture-discuss mailing list -- architecture-discuss@ietf.org
To unsubscribe send an email to architecture-discuss-leave@ietf.org