<?xml version='1.0' encoding='utf-8'?>
<!DOCTYPE rfc [
  <!ENTITY nbsp    "&#160;">
  <!ENTITY zwsp   "&#8203;">
  <!ENTITY nbhy   "&#8209;">
  <!ENTITY wj     "&#8288;">
]>
<?xml-stylesheet type="text/xsl" href="rfc2629.xslt" ?>
<!-- generated by https://github.com/cabo/kramdown-rfc version 1.7.29 (Ruby 3.4.4) -->
<rfc xmlns:xi="http://www.w3.org/2001/XInclude" ipr="trust200902" docName="draft-romm-aipref-contentsignals-00" category="std" consensus="true" submissionType="IETF" tocInclude="true" sortRefs="true" symRefs="true" version="3">
  <!-- xml2rfc v2v3 conversion 3.30.2 -->
  <front>
    <title abbrev="AIPREF Content Signals">Vocabulary For Expressing Content Signals</title>
    <seriesInfo name="Internet-Draft" value="draft-romm-aipref-contentsignals-00"/>
    <author fullname="Michael Tremante">
      <organization>Cloudflare</organization>
      <address>
        <email>mst@cloudflare.com</email>
      </address>
    </author>
    <author fullname="Leah Romm">
      <organization>Cloudflare</organization>
      <address>
        <email>lromm@cloudflare.com</email>
      </address>
    </author>
    <date year="2025" month="October" day="01"/>
    <area>Web and Internet Transport</area>
    <workgroup>AI Preferences</workgroup>
    <keyword>AI preferences</keyword>
    <keyword>content signals</keyword>
    <abstract>
      <?line 57?>

<t>This Internet Draft proposes three categories that would enable parties to express preferences regarding how digital assets are used by automated processing systems. The proposal is for these categories to nest within the larger category of Automated Processing, currently envisaged in the <xref target="AIPREF-VOCAB"/>.</t>
    </abstract>
    <note removeInRFC="true">
      <name>About This Document</name>
      <t>
        The latest revision of this draft can be found at <eref target="https://l-romm.github.io/draft-romm-aipref-vocab-contentsignals/draft-romm-aipref-contentsignals.html"/>.
        Status information for this document may be found at <eref target="https://datatracker.ietf.org/doc/draft-romm-aipref-contentsignals/"/>.
      </t>
      <t>
        Discussion of this document takes place on the
        AI Preferences Working Group mailing list (<eref target="mailto:ai-control@ietf.org"/>),
        which is archived at <eref target="https://mailarchive.ietf.org/arch/browse/ai-control/"/>.
        Subscribe at <eref target="https://www.ietf.org/mailman/listinfo/ai-control/"/>.
      </t>
      <t>Source for this draft and an issue tracker can be found at
        <eref target="https://github.com/l-romm/draft-romm-aipref-vocab-contentsignals"/>.</t>
    </note>
  </front>
  <middle>
    <?line 61?>

<section anchor="introduction">
      <name>Introduction</name>
      <t>This proposal introduces and aims to define a specific set of preferences to address the need for expressing how digital assets can be used by automated systems, particularly in the context of training artificial intelligence (AI) models and generating search results. These preference categories enable clear and explicit communication of preferences regarding the use of digital assets for search indexing and AI training.</t>
    </section>
    <section anchor="conventions-and-definitions">
      <name>Conventions and Definitions</name>
      <t>The key words "<bcp14>MUST</bcp14>", "<bcp14>MUST NOT</bcp14>", "<bcp14>REQUIRED</bcp14>", "<bcp14>SHALL</bcp14>", "<bcp14>SHALL
NOT</bcp14>", "<bcp14>SHOULD</bcp14>", "<bcp14>SHOULD NOT</bcp14>", "<bcp14>RECOMMENDED</bcp14>", "<bcp14>NOT RECOMMENDED</bcp14>",
"<bcp14>MAY</bcp14>", and "<bcp14>OPTIONAL</bcp14>" in this document are to be interpreted as
described in BCP 14 <xref target="RFC2119"/> <xref target="RFC8174"/> when, and only when, they
appear in all capitals, as shown here.</t>
      <?line -18?>

<t>For the purposes of this document, in addition to the definitions in <xref section="4" sectionFormat="of" target="AIPREF-VOCAB"/>, the following terms are used:</t>
      <ul spacing="normal">
        <li>
          <t><strong>Search Results:</strong> hyperlinks and short excerpts returned from contents of the retrieved asset(s)</t>
        </li>
        <li>
          <t><strong>Retrieval-Augmented Generation (RAG):</strong> A technique where external content is retrieved at query time and supplied to a model to condition the generated output (B. Silver)</t>
        </li>
      </ul>
    </section>
    <section anchor="vocab">
      <name>Vocabulary Definition</name>
      <section anchor="search">
        <name>Search</name>
        <t>The act of using one or more assets to build a search index and provide Search Results. Search does not include providing AI-generated search summaries.
The use of assets for Search is a proper subset of Automated Processing usage.</t>
      </section>
      <section anchor="ai-input">
        <name>AI Input</name>
        <t>The act of inputting an asset or assets into one or more AI models for purposes of retrieval-augmented generation, grounding, or other real-time taking of content for generative AI search answers.
The use of assets for AI Input is a proper subset of Automated Processing usage.</t>
      </section>
      <section anchor="ai-training">
        <name>AI Training</name>
        <t>The act of training or fine-tuning AI models.
The use of assets for AI Training is a proper subset of Automated Processing usage.</t>
      </section>
    </section>
    <section anchor="usage-category-labels">
      <name>Usage Category Labels</name>
      <t>Each usage category in <xref target="vocab"/> is mapped to a short textual label. <xref target="_table-category"/> specifies this mapping.</t>
      <table anchor="_table-category">
        <name>Usage category labels</name>
        <thead>
          <tr>
            <th align="left">Category</th>
            <th align="left">Label</th>
            <th align="left">Reference</th>
          </tr>
        </thead>
        <tbody>
          <tr>
            <td align="left">Search</td>
            <td align="left">
              <tt>search</tt></td>
            <td align="left">
              <xref target="search"/></td>
          </tr>
          <tr>
            <td align="left">AI Input</td>
            <td align="left">
              <tt>ai-input</tt></td>
            <td align="left">
              <xref target="ai-input"/></td>
          </tr>
          <tr>
            <td align="left">AI Training</td>
            <td align="left">
              <tt>ai-train</tt></td>
            <td align="left">
              <xref target="ai-training"/></td>
          </tr>
        </tbody>
      </table>
    </section>
    <section anchor="security-considerations">
      <name>Security Considerations</name>
      <t>TODO Security</t>
    </section>
    <section anchor="iana-considerations">
      <name>IANA Considerations</name>
      <t>This document has no IANA actions.</t>
    </section>
    <section anchor="addendum">
      <name>Addendum</name>
      <t>The definition definition for Search (3.1) provided in this document may be replaced with the Search Category definition proposed in <xref target="AIPREF-VOCAB"/> and copied below:</t>
      <t>Using one or more assets in a search application that directs users to the location from which the assets were retrieved.
Search applications can be complex and may serve multiple purposes. Only those parts of applications that direct users to the location of an asset are included in this category of use. This includes the use of titles or excerpts from assets that are used to help users select between multiple candidate options.</t>
      <t>Preferences for the Search category apply to those parts of applications that provide search capabilities, regardless of what other preferences are stated.</t>
      <t>Parts of applications that do not direct users to the location of assets, such as summaries, are not covered by this category of use.</t>
      <t>The use of assets for Search is a proper subset of Automated Processing usage.</t>
    </section>
  </middle>
  <back>
    <references anchor="sec-combined-references">
      <name>References</name>
      <references anchor="sec-normative-references">
        <name>Normative References</name>
        <reference anchor="ASCII">
          <front>
            <title>Key words for use in RFCs to Indicate Requirement Levels</title>
            <author fullname="S. Bradner" initials="S." surname="Bradner"/>
            <date month="March" year="1997"/>
            <abstract>
              <t>In many standards track documents several words are used to signify the requirements in the specification. These words are often capitalized. This document defines these words as they should be interpreted in IETF documents. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements.</t>
            </abstract>
          </front>
          <seriesInfo name="BCP" value="14"/>
          <seriesInfo name="RFC" value="2119"/>
          <seriesInfo name="DOI" value="10.17487/RFC2119"/>
        </reference>
        <reference anchor="FIELDS">
          <front>
            <title>Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words</title>
            <author fullname="B. Leiba" initials="B." surname="Leiba"/>
            <date month="May" year="2017"/>
            <abstract>
              <t>RFC 2119 specifies common key words that may be used in protocol specifications. This document aims to reduce the ambiguity by clarifying that only UPPERCASE usage of the key words have the defined special meanings.</t>
            </abstract>
          </front>
          <seriesInfo name="BCP" value="14"/>
          <seriesInfo name="RFC" value="8174"/>
          <seriesInfo name="DOI" value="10.17487/RFC8174"/>
        </reference>
        <reference anchor="RFC2119">
          <front>
            <title>Key words for use in RFCs to Indicate Requirement Levels</title>
            <author fullname="S. Bradner" initials="S." surname="Bradner"/>
            <date month="March" year="1997"/>
            <abstract>
              <t>In many standards track documents several words are used to signify the requirements in the specification. These words are often capitalized. This document defines these words as they should be interpreted in IETF documents. This document specifies an Internet Best Current Practices for the Internet Community, and requests discussion and suggestions for improvements.</t>
            </abstract>
          </front>
          <seriesInfo name="BCP" value="14"/>
          <seriesInfo name="RFC" value="2119"/>
          <seriesInfo name="DOI" value="10.17487/RFC2119"/>
        </reference>
        <reference anchor="RFC8174">
          <front>
            <title>Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words</title>
            <author fullname="B. Leiba" initials="B." surname="Leiba"/>
            <date month="May" year="2017"/>
            <abstract>
              <t>RFC 2119 specifies common key words that may be used in protocol specifications. This document aims to reduce the ambiguity by clarifying that only UPPERCASE usage of the key words have the defined special meanings.</t>
            </abstract>
          </front>
          <seriesInfo name="BCP" value="14"/>
          <seriesInfo name="RFC" value="8174"/>
          <seriesInfo name="DOI" value="10.17487/RFC8174"/>
        </reference>
      </references>
      <references anchor="sec-informative-references">
        <name>Informative References</name>
        <reference anchor="UTF8">
          <front>
            <title>UTF-8, a transformation format of ISO 10646</title>
            <author fullname="F. Yergeau" initials="F." surname="Yergeau"/>
            <date month="November" year="2003"/>
            <abstract>
              <t>ISO/IEC 10646-1 defines a large character set called the Universal Character Set (UCS) which encompasses most of the world's writing systems. The originally proposed encodings of the UCS, however, were not compatible with many current applications and protocols, and this has led to the development of UTF-8, the object of this memo. UTF-8 has the characteristic of preserving the full US-ASCII range, providing compatibility with file systems, parsers and other software that rely on US-ASCII values but are transparent to other values. This memo obsoletes and replaces RFC 2279.</t>
            </abstract>
          </front>
          <seriesInfo name="STD" value="63"/>
          <seriesInfo name="RFC" value="3629"/>
          <seriesInfo name="DOI" value="10.17487/RFC3629"/>
        </reference>
        <reference anchor="AIPREF-VOCAB">
          <front>
            <title>A Vocabulary For Expressing AI Usage Preferences</title>
            <author fullname="Paul Keller">
              <organization>Open Future</organization>
            </author>
            <author fullname="Martin Thomson" role="editor">
              <organization>Mozilla</organization>
            </author>
            <date year="2025" month="October"/>
          </front>
          <seriesInfo name="Internet-Draft" value="draft-ietf-aipref-vocab"/>
        </reference>
      </references>
    </references>
    <?line 144?>

<section anchor="acknowledgments">
      <name>Acknowledgments</name>
      <t>TODO acknowledge.</t>
    </section>
  </back>
  <!-- ##markdown-source:
H4sIAAAAAAAAA61XbXPjthH+zl+Byl9sjynHuZv2qkmT6Pxy0dQ+uX5JJtPp
zEEkLGEMEiwASqf49F/6W/rL+iwAUqQtJ22m/mIRWOwunt19dpGmaeKkU2LE
Bj/qjM9qxc2aXWjDzj9XRlgryzk71aUTpWO3cl5yZQcJn82MWOLMeHJ9c37x
UiDjTsy1WY+YdXmS5DoreQEjueEPLjW6KFIuof8hzcJRG06mX32V2HpWSBjG
xrrCmcn53UVS1sVMmFGSQ/EowSErSlvbEXOmFglceZNwIzhc+knMGC9zNoFe
UwrH7gwvbaWNGyQrbR7nRteVd51dwwFhRJkJuPwo1tjORwlLGfaq7R6tRDdZ
9DNZirKGI4y9po6x4P3gJ9gkED+QIK0XXCqsc+nvbrT6Xgr3MNRmTrvcZAvs
Lpyr7Oj4mIRpSS7FsBE7poXjmdErK463ao7p+Fy6RT2DAuVRPn4J+JLC/Ax2
OqmArHUd00HDMGgcSv1f6toh1hcYLlyhBknCa7fQhvCGdcYeaqVCkgyuZLbg
QiFyouA4OvACuDkv5S/cITNG7FTpOn8ANsJviohqYd33Wbs1zHQx2GHgUvAF
u4GH/4NmRTd6oTsptSlwcOmTYXx7OpmM2M3F6dcnJ3/GwsXk/PLs1q+8O/nT
2ySR5UP3wP3dxTu/++aPX5N8qKf0x+np+P3Im2+qc8xer0/k3r3lc/E8Axnz
5RKrjtKniQh3jmeLlLa9nBVGCkvOBausLZ/0jA7v0uGD76XbSPq/NP7vIn7N
a8X+KpQSpt3toz6tRMkualdH2HfrueLGyZLdLXRhddkKIPuxKXLp9Gv6r/Qv
UimeJMPhMEnSNGV8Zp3hmUuSu4W0W77wF0b960pbYZlbGCFYJDTpF7hjK12r
nImSz5RgFXlFO5qJEJUufTAj5tzkFKiFXrFcoqK4Ytxa4SzqXbDaipzN1oSj
Rm7gA9azGFy7tk4UdohLi+gVTsNhJBJ8EbbvnGYlypitULUACvsobDMXphFa
M/3Axq2h69bQEctqA4+dWuNeS0kJlbOo4+mpm5mbTYSwkHmuRJLsEXpG53VG
YEdAt77GPbhHxMxl4f3MxYMsBePMViKTDzJDFjryrosd5Hiee0jJj1LAJ7q4
2Gb/DlAzXrLZLlwjmEchZBlVE64bL+lp6rN3AYkhS1JOYnBNhmsggeWcHGP7
48kBK3QuVLgUVoVBqlHABPEzom5r5ULcrOhcqhuumECZwhmvB/dSMOfgTFHU
pcx8+j5HZZtR5DeuSQLPMCCUoiuyzMVnfxtYAFk0txtS5NC70czISrjJGYVF
+m8KpGBojIw6owUx39/eDY7Cf/Zx6n/fnP/tfnJzfka/b38YX162P5IocfvD
9P7ybPtre/J0enV1/vEsHMYq6y0lg6vxz9ghrwbT67vJ9OP4chCihfzCUFEX
1JOphJAnCDiFyAAoCjW3SS5sZuQspPH70+t//+vkLVL5D5GfN5v4QdSMj9VC
lMGaLpEV4RMArxNeVRQgaOFKIX4VAY0s4pZZpF/JFogM0Dz8OyHzjxH7ZpZV
J2+/jQt04d5ig1lv0WP2cuXF4QDijqUdZlo0e+vPkO77O/65993g3ln85jtF
hZuevPvu2yRJLgIPsao2gTGpfroBOvLA5blPKooUSefbNKPtp6db4cmDvfUE
1WMbHwUktFJ65ZNemGLLnCPgzg4Pb0Ou34SyGx0esgUGMANXH0NiI1LGocAy
pIijGkKvKYlO0Nib+S46L2gXBbr0eYRy2rcH3shNWOYqHddzuhsEPsTSh+/7
N+MPB2R6DB+zRSn/WQvKI3gKYkF7QXk2k6S0XSOOQRTk7GQhgrd1BSbAFjFg
YBr6idMNjvAysg6kdO2q2rH990MM4WopzAHVdmdk2JY1e9rzrXsDiT0WYXva
C1yxCTWPtkhI1J5gNYKNGBca14jkQuVWS3RA3uMY7zl4fylzwfoBGTbfuUaK
lBoAlJmqcxHlwxyTbm8U9dq6KDhx5dA7Frmuw3FRLeDkvuWg1eEFEVvJrkYH
HWhtQ397kOGkJOCe9jBIS/rZR8AvucCdwSpBEc2DbnQPHaiLPYE86xaEaROH
t4kzbxPnyD8jyty3YZzUCK7BGYj7hHDcPyGgp8ke0t+cX3rDES+8dVbCvIpW
e+Hfjddd0xo9ZE0r6aPWtk9YpCafurqMg2rA51fca/X/Hg/jHHzazDqXfAZr
SXKOaTdIbecgzzqhEjZkrCCWj/UWuIKmgRolq0jLENKOunXaaMCxOLr4qTCq
CH31y9YH/H0JjoS59AtKohkF4kryZZR2/npf/S2/Au0x66PCTyH4n/zH01Ms
5U2rfRv2IN7k+icv3mb+piPehiGI+4i24m3UNxB/GrG9PjDh0fKXwX0fcA+j
HSBVEuIdjJvSrWn+sKCLUAg0ckzPpu2uF52MP45fivVGgAUnTgmS3PcR69Nh
nOeizOvCp9u25XR/djhk/83w5KDhr/zloFHwNU0ZRlSKZxCgIdvzcDzfhryj
Pr4j8pBu/b7m6TLTFdE8oNErtLL71yiXemhb5NQb4mjonyO5NGiflirK2KbF
Kh1FfIdbLfCq9utR4YraUtuChsntC93tII1htFKR3gkEWAHpFOB1Walt5x+y
KQ1OeAza8Cjy1NdT2PH2FWfpREO11OJjm9iGo/uQgQoasKVtxGx3IvZpaJl/
LsSu76Fomhj50r6/4MdCqCp6ZYUiF2fCrQQepu1VAUgu6dnMdNWkWefV3TzK
moxofSUQ1uGuvwFO0z5to6HiM6kkPTCP4uCv6DmEwyuSD72i+zqgK1lHHEnO
/UoYtO/DvxkNj9YROJiyw2478pG3RCoyjYkjvLV2hij5v/duen3OePboizx7
LPVKidw3VhsYhLeLJP4fd8mpxOgUAAA=

-->

</rfc>
