WD-P3P-Syntax-19980519

Platform for Privacy Preferences (P3P)
Syntax Specification

W3C Working Draft 19-May-1998

This Version
http://www.w3.org/TR/1998/WD-P3P10-syntax-19980519.html
Latest Version
http://www.w3.org/TR/WD-P3P10-syntax
Previous Version:
Please see drafts at the Syntax and Encoding WG Page. [Member only]

Editors
Massimo Marchiori, W3C
Dan Jaye, Engagetech

Status of This Document

This is a W3C Working Draft for review by W3C members and other interested parties. It is a draft document and will be updated at least once (the first within 6 weeks) before being advanced towards proposed recommendation status. This document has been produced as part of the P3P Activity, and will eventually be advanced toward W3C Recommendation status.
It is inappropriate to use W3C Working Drafts as reference material or to cite them as other than "work in progress." While this is the first public draft the underlying concepts of the draft are fairly stable and we encourage the development of experimental implementations and prototypes so as to provide feedback on the specification. However, this Working Group will not allow early implementations to affect their ability to make changes to future versions of this document. For instance, XML elements and attributes are presently named for readability, though they may shortened to decrease the size of the encodings.
This draft document will be considered by W3C and its members according to W3C process. This document is made public for the purpose of receiving comments that inform the W3C membership and staff on issues likely to affect the implementation, acceptance, and adoption of P3P. Comments should be sent to p3p-comments@w3.org.
___
Copyright © 1998 W3C (MIT, INRIA, Keio), All Rights Reserved. W3C liability, trademark, document use and software licensing rules apply.

Index

Introduction
Agreement Scenarios
Protocol: Transport, Primitives and Reason Codes
P3P Schema
Base Data Set and Data Types
1. Required (Base) Data Elements and Sets
2. Data Types
Appendices
Appendix 1: References
Appendix 2: Line-flow Scenario
Appendix 3: DTDs
Appendix 4: Fingerprints and Canonicalization
Appendix 5: ABNF Notation
Appendix 6: Possible Future Features
Appendix 7: Working Group Contributors

1. Introduction

The Platform for Privacy Preferences Project (P3P) enables Web sites to express their privacy practices and users to exercise preferences over those practices. P3P compliant products will allow users to be informed of site practices (in both machine and human readable formats), to delegate decisions to their computer when appropriate, and allow users to tailor their relationship to specific sites. Site practices that are compatible with a user's preferences can, at the user's option, be accessed "seamlessly". Otherwise users will be notified of a site's practices and have the opportunity to agree to those terms or other terms and continue browsing if they wish.

P3P gives users the ability to make informed decisions regarding their Web experience and their the ability to control the use of their information. Sites can use P3P to increase the level of confidence users place in their services, as well as improve the quality of the services offered, customize content, and simplify site access.

P3P uses RDF/XML for the exchange of structured data and assertions. P3P will support future digital certificate and digital signature capabilities. P3P can be incorporated into browsers, browser plug-ins, servers, or proxy servers that sit between a client and server.

1.1 Problem Space

The P3P specification provides mechanisms:

For a user agent to be informed of a site's data collection and privacy practices.
For a user agent and service to automatically negotiate and to come to an agreement satisfactory to both parties; alternatively, for the user agent to notify the user and take instruction concerning proposed data exchanges from the user.
To exchange data when such exchange is authorized by the user and consistent with a user's preferences and any outstanding agreement.

1.2 About This Specification

This document includes all the specification necessary for the implementation of interoperable P3P applications. This specification was built upon the work of previous working groups and includes:

Protocol primitives.
Protocol schemas, including the privacy practice vocabularies of the proposal.
Protocol syntax and encoding, including the data structures and formats.
We use the following typographical conventions for representing the status of text:

Schema definition in ABNF notation.
Brief implementation note.

The ABNF notation used in this specification is specified in RFC2234 and summarized in an appendix. The DTDs provided in the Appendix are non-normative and are provided only as an exercise in ensuring the robustness of the data structures.

1.3 Operational Description and Design

A P3P proposal consists of one or more statements, each of which expresses privacy practices for a set of data elements. The P3P proposal should cover all relevant data elements and practices: if a service wants to collect a data element, it must be mentioned in the proposal. Note that most declarations in P3P are positive, meaning you state what you do, rather than what you do not do.

A core concept to the P3P specification is that of a P3P agreement. A P3P agreement is a proposal agreed to by both the service and user agent. User agents compare the privacy practices specified in a proposal with the user's preferences to determine whether to enter into an agreement. An agreement applies to all data exchanged between the user agent and service within a specified realm -- a Web resource or "tree" of Web resources referenced by a single URI, or list of Web resources referenced by a list of URIs.

Once an agreement is reached the enumerated data elements that are included in the agreed-to proposal should be interpreted by the user agent as data requests from the service. It is possible to have an agreement without referencing data elements. Such an agreement states that no information is collected.

When privacy practices do not match the user's preferences, the two sides can come to an agreement by exchanging alternative proposals. Every proposal can have a set of consequences that can be shown to a human user to explain why the suggested practice may be valuable in a particular instance even if the user would not normally allow the practice. We expect user agents will record the agreements reached, indexed by a fingerprint of the agreement, called the agreementID.

Rather than sending a new proposal to the user agent on every contact, a site may send the agreementID of an existing agreement. This is 1) asserting that the service and the user agent have already agreed to a proposal, 2) signaling which proposal and privacy practices it is operating under, and 3) requesting those data elements referenced in the agreement. The user agent may turn away, respond with the requested data, or request a full proposal -- if either it has no record of such an agreement or it desires a new one.

Sites also have the capability, as part their proposal, to request that a Temporary or Pairwise Unique ID (TUID/PUID) be automatically sent to the site when the user agent returns. A PUID is similar to a cookie but under P3P control and limited to a numerical field.

Our design is such that applications can be efficiently implemented independent of our assumptions or expectations regarding the latency of multi-round communications, the cacheability of proposals, the use of user agent or server side data repositories, and the size of the agreement repositories.

1.4 Terminology

Assuring Party: Within P3P, an assuring party is an entity which makes some statement of assurance about a proposal (e.g. that practices are audited, or are in compliance with certain data collection guidelines). Assurance may come from the service itself or from an independent third party. The assuring party must identify what it is attesting to as part of the assurance statement within the proposal, at a specified URI, or as part of the semantic definition of a meta-data schema.
Agreement: A proposal to which both the service and user agent agree. This agreement is applied within the realm and is often represented by a an agreementID. The non-repudiability of such agreements will be strengthened by the support of certificate and digital signature capabilities in future versions of P3P; however this is not specified in version 1.0. We do provide the appropriate fields for the inclusion of such tokens within P3P1.0 (e.g. a digital signature from the assuring party.)
AgreementID: A small unit of information that indicates both parties have agreed on a common proposal. An agreementID is the fingerprint of an accepted proposal. The presence of the agreementID in the P3P headers is the definitive declaration of which agreements for a given realm are in effect.
Data Element: An individual data entity, such as last name or phone number. For interoperability, P3P 1.0 specifies a base set of data elements.
Data Category: A significant attribute of a data element or data set that may be used by a trust engine to determine what type of element is under discussion, such as "Contact Information." P3P 1.0 specifies 10 base data categories.
Data Set: A known grouping of data element, such as such as "User.Home.Postal.". A set is represented with a trailing period. P3P 1.0 specifies a number of base data sets.
Fingerprint (Hash, Digest): A fixed size value resulting from the application of a hash function to a set of data. Fingerprints are easily computed, but even small modifications to the original information result in a virtually unpredictable value. Consequently, fingerprints are often used as a key to entries in a table or databases, or as methods of assuring information integrity. We use the MD5 digest algorithm for P3P fingerprints. This yields a 128-bit value that will be encoded and exchanged in P3P communications.
Preference: A rule, or set of rules, that determines what action(s) a user agent will take or allow when involved in a conversation or negotiation with a service. A preference might be expressed as a formally defined computable statement (e.g., a PICSRules rule). In this document, preferences govern the types of agreements that can be reached between a user agent and a service.
Proposal: A proposal is a collection of one or more privacy statements together with information asserting the identity, URI, assurances, and disclosures of the service covered by the proposal. A proposal is always created from the point of view of the service and contains identifying information for the service, but it may be created by the user and sent to the server for approval.
PUID: PUID := hash(UUID, Realm, agreementID). The PUID is an optional way for the user to identify herself to a service under a particular agreement over an extended period of time. PUIDs directly correspond to a specific realm and agreement. PUIDs are sent within the P3P header as part of the ReturnID with any GET request.
Realm: The realm is the experience space from which requests under a given agreement may be issued -- it broadly defines the area to which a proposal applies. It is referenced by one or more URIs. The definitive declaration of which agreements are in effect for a given URI is determined by the presence of agreementIDs. Each URI may name a specific resource or a set of resources qualified by the URI. For instance, in the HTTP URI scheme, a URI ending with an object (home.html) applies to that specific object, a URI that is a path http://www.w3.org/P3P/ references the file system tree below that path. If the proposal is not digitally signed, then each of the URIs must be from a domain that domain-matches the domain of the origin server. Domain matching is covered in the HTTP state management mechanism internet draft [STATE].
Repository: A mechanism for storing user information under the control of P3P.
ReturnID: ReturnID = (agreementID, [PUID], [TUID]), where PUID and TUID are optionally returned.
Service: A program that issues proposals and (possibly) data requests. By this definition, a service may be a server (site), a local application, a piece of locally active code, such as an ActiveX control or Java applet, or even another user agent.
Statement: A P3P statement is a set of privacy practice disclosures relevant to a collection of data elements, sets, and categories. The enumerated elements act as an embedded data request. A statement which references no data, requests no data.
TUID: TUID := hash(UUID, session salt) A temporary ID generated by the user agent that is returned to a service for the duration of one session. A new TUID should be generated for every session at every service. If a service wants a persistent ID across sessions, they should request a PUID.
URI: A Uniform Resource Identifier used to identify Web resources. For definitive information on URI syntax and semantics, see "Uniform Resource Identifiers (URI): Generic Syntax and Semantics," (which replaces RFCs 1738 and RFC 1808). This document refers to the following URIs:; proposal-URI: URI at which a proposal may be fetched.
post-URI: URI which information may be transmitted to.
disclosure-URI: the complete natural language privacy statement, referenced by the TEXT attribute in the proposal.
User Agent: A program that communicates with services regarding privacy and data exchange on behalf of the user.
UUID: A mechanism used for creating PUIDs and TUIDs. UUID is specified by [UUID]. P3P does not specify a method for collecting UUIDs.

1.5 Assumptions

P3P makes several assumptions about the environment in which it works and the problem it is trying to solve.

P3P allows a service to include identifying information about itself in its proposals. If no identifying information is provided, the service identity is assumed to be the registered owner of the service's domain.
Strong non-repudiable evidence of identities and agreements will be provided by future versions of the protocol. At this time, no clear/dominant PKI model exists for use in P3P. In the future both users and services may require signatures and certificates.
We assume that communication security is achieved through means other than P3P itself [SSL]. Hence, P3P does not provide mechanisms for cryptographically protecting information in storage or transit.
P3P agreements are end-to-end: between the user and the service. Intermediaries such as telecommunication providers, internet service providers, proxies and others may be privy to the exchange of data between a service and a user but those practices are not included in the agreement between the end parties.
While the protocol extension features [MANDA] are based on [HTTP1.1], P3P can work with [HTTP1.0] servers and proxies.
P3P covers data generated or exchanged via HTTP. While P3P proposals are separate from HTML forms, P3P proposals can accompany HTML forms and can govern the privacy of the data input into the accompanying form. [See the section on abstract elements.]
XML element attribute values may be of arbitrary length.

2 Agreement Scenarios

The following scenarios are designed to illustrate several possible ways that P3P might be used. Each scenario highlights a different P3P feature. Scenario 1 shows how a basic P3P agreement is established. Scenarios 2 and 3 show what happens when a user returns to a site where an agreement has already been reached. Scenarios 4 and 5 highlight the use of the user data repository given existing agreements.

The scenarios illustrate interactions in which no negotiation occurs. However, any scenario could be expanded to one in which a site initially offers multiple proposals, or one in which a series of proposals and/or counter-proposals are exchanged between the site and the user. The expanded example scenario in the Appendix includes an example of negotiation.

The table below summarizes the features of each scenario. A "-" in the table indicates a feature about which no assumptions (neutral) are made in a given scenario.

Scenario Existing Agreement New Proposal PUID Requested Data Requested Data Written

1 No Yes Yes - -

2 Yes No No - -

3 Yes Yes Yes - -

4 - - - Yes No

5 - - - Yes Yes

s1) No existing agreement, site sends proposal and requests PUID

User goes to home page of CoolCatalog (CoolCatalog/*), she has never been there before but is P3P-compatible.
CoolCatalog makes a proposal, including privacy practices, disclosures, and the data elements to which they apply. In this instance, the service asks to be automatically given the PUID as part of the returnID in the future.
User agrees to proposal, and agrees to send the PUID in the returnID on all future requests to the realm specified in the proposal.

Protocol Scenario: Client sends OPT header to server. If server does not support P3P, it will merely return content. (This content could be a HTML form which transfers information to a server using traditional CGI methods.) If the server supports P3P and has a relevant proposal, server shall return 409 with content and a proposal for P3P clients in a header, HTML header, or as referenced by a URI.

s2) Existing realm agreement

User has previously done scenario 1.
User returns to CoolCatalog and -- following the existing agreement -- furnishes the returnID on all page requests.

Protocol Scenario: Client sends OPT header to server, and all relevant returnIDs. Server returns relevant content or (orthogonally) can send subsequent P3P data-requests (TXD with agreementID).

s3) Existing realm agreement, new proposal

User has previously completed scenario 1.
User returns to CoolCatalog and furnishes returnID of Agreement(1) on all page requests.
User goes to order page on CoolCatalog (CoolCatalog/Order/*) and sends returnID of Agreement(1) and the service makes a new proposal because its policies at the order page differ from the rest of the site. In this instance, it wants actual order information.
The user considers the new proposal. We consider two cases, differing on whether a new PUID (as part of a returnID) is requested.

Case 1:
- The new proposal does not include a request for a new returnID, the proposal is agreed to.
- Future requests to order page will only include returnID of proposal of (CoolCatalog/*), not the order page.
Case 2:
- The new proposal does include a request for a new returnID, this proposal is agreed to.
- All future requests to order page include both the (CoolCatalog/*) and (CoolCatalog/Order/*) returnIDs.

s4) Service wants data from client repository

Any of scenarios (1,2) may apply to the client's first request for content.
In order to request data, the service has two options:
1. send agreementID
  - if the client does know it (by doing lookup against valid agreements entered into to by this client), the client transfers the requested data back to server (TXD).
  - if the client does not know it (either because it never agreed to anything with this service or it simply ran out of space in its log) the client sends a sorry (SRY) and reason code 403 (Unrecognized Agreement). Server then sends the complete proposal as follows in option 2.
2. sends the complete proposal.
  - The client inspects the proposal, matches user preferences or prompts the user for agreement, and either sends data using TXD, negotiates usage terms, or breaks off negotiation.

s5) Client write information to service

This is an added step to scenario 4, where the service's proposal includes writing one or more data elements to the client data repository.
In this case, once the user has accepted the agreement, the service will send the appropriate data elements, which are then saved transparently by the user agent.

Protocol Scenario: Scenario 5 is an extension of scenario 4 which assumes that scenario 4 ended with the user agent sending an OK and optionally a TXD to the server. Now the server will reply with its own TXD sending the appropriate data elements to the client. If successful the user agent will respond with an OK/200 return code; if unsuccessful the client will respond with SRY/500 return code.

3 Transport, Primitives and Reason Codes

The following section specifies primitives necessary for forming an agreement and transferring any subsequent data if appropriate.

3.1 Data Transport

P3P was designed to exchange the primitives and their content in the headers of a P3P HTTP extension. However, we foresee instances in which the content of the PROP or TXD primitives could exceed the expected length of a header in some HTTP implementations (e.g. a large graphical image). Hence, as an application of RDF/XML, P3P implementations should be able to conform to methods of RDF/XML exchange when those methods are completely specified. Those methods are likely to be:

Within the HTTP extension header.
At a specified URI that a client or server then obtains with a HTTP GET request.
As part of the content of a XML or HTML page; if HTML, it should not be rendered by older implementations.

P3P1.0 specifies a required method for methods 1-3.

Scenario Proposal-URI Post-URI Proposal Embedded in Content

6 Yes - No

7 No - Yes

8 - Yes -

s6) Service references proposal at URI

The service believes that an intermediary proxy cache might truncate long HTTP headers, or the service would like to its proposal to be cached at a proxy:

User sends GET to service and requests proposal.
Service sends prop-message with the proposal-URI.
<proposal AGREEMENTID="94df1293a3e519bb" proposal-URI="http://www.CoolCatalog.com/P3P.RDF" />
User sends GET to the proposal-URI.
The service at the proposal-URI sends the proposal.

The proposal-URI is not relevant to the definition of the realm. Rather, implementations which need to make trust decisions on the basis of the realm should make them with respect to the REALM defined in the proposal, and the original URI they fetched in scenario (1).

s7) Service embeds proposal in HTML content

Content providers may wish to declare simple privacy practices without the sending modified HTTP headers. In such a case, sites MAY embed a simple P3P proposal within the HTML HEAD tag. Since a P3P proposal does not contain content within tags (<tag>content</tag>) we need not worry that older versions of HTML browsers will render such data. However, such a proposal has the following constraints:

They MUST be embedded within the HEAD tag.
The proposal MUST refer to a proposal-URI.
Only a single proposal may be made for any realm. No overlapping proposals are permitted.

Consequently,

User sends GET to service and requests a proposal through the presence of the optional header.
Service returns content without header.
User agent MUST examine the HEAD tag of the HTML-content for a proposal.
Upon finding such a header, it would fetch the proposal from the proposal-URI as described in scenario 6.

Note: We hesitate to recommend this method because of the potential (undetermined) overhead of having to seek proposals in the content and the complexity of managing a site with embedded proposals. Consequently, this method should only be used for sites which 1) do not use P3P data methods and 2) have a few, simple, non-overlapping proposals for their site.

s8)Client posts information to a URI

The service requests a large binary file, or prefers to process data as it did with forms/CGI and wishes to have that information presented using POST.

User sends <get> to service and requests proposal.
Service sends a proposal that includes a post-URI.
User posts RDF/XML encoded information at the post-URI.

The post-URI is an implicit part of the REALM. Consequently, information may be posted to the provided post-URI under an agreementID even if the URI is not within the REALM-URI.

3.2 P3P Requests

The user agent communicates to the server using standard HTTP methods such as "GET" or "POST". P3P uses the mandatory extension mechanism [MANDA] for transporting data between the agent and service. When placing an initial request to the server, the user-agent MAY include the p3p-opt-header to notify the server that the user agent is P3P-compliant. If the user-agent sends the OPT header, and the server implements the P3P protocol, then it MUST:

Send the OPT header that indicates that it understands P3P.
If it has a proposal that covers the page in question, it MUST send it.

3.2.1 Syntax of P3P Requests

[1]	`p3p-request`	`=`	`start-line` `*message-header` `"OPT" ":" p3p-opt-header` `[p3p-header-prefix "P3P1.0" ":" p3p-header CRLF]` `[message-body]` `; start-line, message-header, CRLF, and` `; message-body are as defined in the` `; HTTP 1.1 specification [HTTP1.1].`
[2]	p3p-opt-header	=	p3p-extension ";" "ns-" p3p-header-prefix
[3]	p3p-extension	=	`"` `http://www.w3.org/TR/1998/WD-P3P10-syntax#proposal.DTD`/ `"`
[4]	p3p-header-prefix	=	1*digit "-"
[5]	p3p-header	=	"<p3p" p3p-qualifier* ">" 1*p3p-content "</p3p">"
[6]	p3p-qualifier	=	( ("FINAL=" \| "FIN=") yesno ) \| (<"> token <"> "=" token)
[7]	`yesno`	`=`	(`"` "1" `"` \| `"` "Yes" `"` ) \| (`"` "0" `"` \| `"` "No" `"` )

3.3 Negotiation Primitives

The following table summarizes the meaning of the P3P primitives as well as their relations, content, and HTTP return codes. All primitives may be sent by a service or agent.

Message Meaning After Receiving Expected Response Data in Message Optional in Message HTTP Return Code

OK Proposal acceptable or data transfer successful PROP or TXD none MD5 hash of agreement or data transferred 2xx

PROP Here's a Proposal Any time OK, SRY, or PROP Text of a proposal Signature of initiator, fingerprint of previous Proposal 409

SRY Sorry - request not processed PROP, TXD PROP or none Reason code and MD5 hash of proposal or data transferred Which practices are unacceptable 409

TXD Transfer Data Any time none, OK or SRY Data element names and values to be written, as requested Agreement 2xx

This section describes each of the operations and specifies who (user or service) can initiate the action and under what circumstances.

3.3.1 Success (`OK`)

This message is sent when an entity agrees to a proposal, or when a data transfer has succeeded. If the OK is sent in response to a proposal, the OK message includes the agreementID. If the OK is sent in response to a TXD message, the OK message includes the MD5 hash of the data that has been transferred.

3.3.2 Here's A Proposal (`PROP`)

At any time, either participant can send one or more proposals to the other, these are sent in the prop-msg. The proposal's terms aren't binding until the other side has agreed to them (by responding with the OK or TXD primitives). The proposal may be signed by the party that creates it if so desired. In addition the PROP MUST nclude the agreementID of a proposal previously received from the other party (this may help the other party keep track of the negotiation). If a site wishes to express that a data element is optional, it may do so within the proposal; the user agent will return the optional elements it feels appropriate. An agreement over a proposal with optional purposes or qualifiers is ambiguous; there is no clear way for the agent to express that one purpose was agreed to, but another was not. Consequently, optional purposes or qualifiers MUST be expressed through multiple unambiguous proposals.

If a proposal isn't automatically acceptable to the user agent, there are three options. The user agent must be programmed in some manner to decide which response is appropriate (including prompting a user for guidance):

Refuse (SRY) the proposal. This implies a desire to receive a new proposal from the service. Notice that there is a potential for an infinite loop if the user agent continues to send a SRY message should the same proposal be sent repeatedly. The user agent is responsible for maintaining sufficient state to detect this case and respond using one of the other techniques after it determines that a sufficient number of negotiation rounds have occurred; the service might, after all, choose to change its proposal if the agent appears sufficiently obstinate.
Return a Proposal of its own. This implies a refusal, but provides an alternate that the user considers acceptable if accepted by the service. The service has the right to assume that by making the offer the user is willing to accept it if the service does.
Note: the creation of a counter-proposal by the user agent is not required for P3P1.0 compliance.
Walk away from the negotiation. There is no requirement that the user agent respond at all. The user agent may simply not send another request to the service.

If the proposal is acceptable to the user-agent, it should transmit the data to the service along with its OK. By transmitting the data under an agreementID, the user agent indicates that it has accepted the proposal. The TXD includes the agreementID of the accepted proposal, and the service can determine which optional data elements the user has sent by inspecting the data objects transmitted.

3.3.3 Sorry (`SRY`)

This message is sent in response to a proposal (PROP) or a data transfer (TXD). It indicates that the request could not be processed. The refusal includes two pieces of information: a reason code and agreementID. The reason code indicates what type of request is being refused and why it is being refused. Since multiple proposals may be sent, the agreementID is necessary to identify which request is being refused. The particular causes for refusing a proposal are given by the reason codes.

Finally, the SRY message can be used in response to a Transmit Data (TXD) message when the data transfer fails. The sender SHOULD NOT retry the TXD message.

Note: When a proposal is refused, P3P1.0 does not include a mechanism to specify exactly what portions of the proposal are unacceptable. A mechanism to indicate what part of a proposal is unacceptable may be included in a future version of the P3P protocol. In P3P1.0, a possible work-around for services or user agents which wish to do detailed negotiation is to send a counter-proposal, modifying or deleting those portions of the original proposal which were unacceptable. Implementers should be wary of the privacy implications of releasing user privacy preferences in either case.

3.3.4 Transmit Data (`TXD`)

After the receipt of a proposal, the user agent may send out the requested data. The user agent MUST include the agreementID(s) it believes it is operating under. The service MUST honor the agreements if they are valid. If the agreementID refers to an invalid or nonexistent agreement, the service MUST respond to the TXD with a SRY message, setting the response code to 401 (Unrecognized Agreement).

If the proposal requests optional elements, the TXD may not include all the requested data elements. It is up to the user agent to decide how many of these to transfer; the agreement MAY inform the user of the consequences (presumably rewards) of sending the optional items.

3.3.5 Ending negotiation

There is no primitive to end negotiation. Instead, this capability is provided through a qualifier on a P3P message. Any PROP or SRY message can include the FINAL qualifier. When a service receives a PROP with a FINAL qualifier, it MUST respond with either an OK or a SRY. The response MUST include a content-body, though it may be as simple as an HTML document which states that the requested object cannot be returned without an agreement. When a user-agent receives a PROP with the final qualifier, it should understand that the service has presented its "final offer", and that further negotiation is likely to be pointless.

3.3.6 Syntax of Negotiation Primitives

[8]	p3p-content	=	OK-msg \| PROP-msg \| SRY-msg \| TXD-msg \| extension-msg
[9]	OK-msg	=	"<ok" message-attribute "/>"
[10]	PROP-msg	=	(short-proposal \| proposal)
[11]	SRY-msg	=	"<sry" message-attribute "/>"
[12]	TXD-msg	=	"<txd" message-attribute ">" data-xfer "</txd>"
[13]	extension-msg	=	"<" token message-attribute ">" *token "</" token ">"
[14]	message-attribute	=	agreement-id \| reason-attribute \| data-hash-attribute
[15]	reason-attribute	=	"r" "=" `"` reason-code `"`
[16]	data-xfer	=	<XML formatted data element name-value pairs>
[17]	agreement-id	=	<base64 of 128 bit MD5 digest of proposal as per RFC 1864>
[18]	data-hash-attribute	=	"dh" "=" <base64 of MD5 digest of data-xfer> ; used to acknowledge the receipt of data

3.4 Reason Code Definitions

Reason codes are optionally sent within P3P headers: every SRY message carries an implied 400 reason code, and every other message carries an implicit 200 reason code; including a reason code in the message overrides these defaults. Reason codes can be divided into three main classes: success codes, rejection codes and error codes. A detailed explanation of these three classes follows.

3.4.1 Success codes

This class of reason code indicates that the request was successfully received, understood, and accepted.

200 OK

This reason code can be sent in an OK or TXD message in response to a PROP or TXD message. It is the default value reason code for OK, PROP and TXD messages.

3.4.2 Rejection codes

This class of reason codes indicates that the request was successfully received and understood, but not acceptable. These codes MUST be sent only in a SRY message.

400 Other Rejection

This reason code MAY be used in a SRY message when none of the other rejection codes defined below are appropriate. It is the default value reason code for SRY messages. User agents and services are encouraged to use one of the other rejection codes whenever possible.

430 Unrecognized Agreement

This reason code MAY be sent on a SRY message in response to a PROP or a TXD message. This reason code MUST be sent by the recipient of a TXD when the agreementID does not reference a known and valid agreement.

431 Agreement Expired

This reason code MAY be sent on a SRY message, in response to a PROP message. It indicates that the agreement referenced by the agreementID has expired.

432 Proposal Rejected

This reason code MAY be sent on a SRY message, in response to a PROP message. It indicates that the proposal is unacceptable to the entity sending the SRY message.

433 Data Unavailable

This reason code MAY be sent in a SRY message, in response to a PROP message. It indicates that the user-agent does not have the data that is being requested.

434 Data Not Accepted

This reason code MAY be sent in a SRY message, in response to a TXD message. This can indicate that the receiver is unable to accept the offered information.

Error codes

This class of reason codes indicates that the request was not received successfully, or that the request was received but could not be understood. These codes MUST be sent only in a SRY message.

500 Other Error

This reason code MAY be used in a SRY message when none of the other error codes defined below are appropriate. User agents and services are encouraged to use one of the other error codes whenever possible.

530 Invalid Format

This reason code can be sent in a SRY message in response to any message. It indicates that the P3P message that was received was syntactically invalid or semantic garbage. For example, if a proposal includes incorrect syntax, or ambiguities or contradictions in its statements, the agent should reject such a proposal under the 530 reason code.

531 Data Transfer Unsuccessful

This reason code can be sent in a SRY message in response to a TXD message. It indicates that a data transfer request was received, and it was syntactically valid, and covered by a valid agreement, but the data could not be stored for some reason.

[19]
reason-codes

=

200 | 400 | 430 | 431 | 432 | 433 | 434 | 500 | 530 | 531

4. P3P Schema

This section describes the schema for 1) P3P proposals and 2) data transfer. A more complete English language description of the terms used in a proposal may be found within [HARMV].

4.1 Proposal

We would like to encode the following statement; it is one proposal, with two statements within which are logical ANDs. "Other disclosures" automatically apply to the whole proposal.

CoolCatalog, a member of PrivacyAssured, makes the following statement for the Web pages at http://www.CoolCatalog.com/catalogue/.
We collect your name, age, and gender to customize our entry catalog pages for the type of clothing you are likely to be interested in and for our own research and product development. We do not use this information in identifiable form. We also collect your shipping information so that we can send you your order; this is of course is identifiable. We do not redistribute any of this information outside of our organization. We do not provide access capabilities to information we may have from you, but we do have retention and opt-out policies, which you can read more about at our privacy page http://www.CoolCatalog.com/PrivacyPractice.html..

The following is a more formal description, where comments are preceded by a semicolon:

in proposal schema
CoolCatalog makes ; entity
this statement for http://www.CoolCatalog.com/catalogue/ ; realm, where the * is assumed
with assurance from PrivacyAssured

we read ; read is default
   (User.Name.First) ; required is default
   (User.Bdate.Year OPTIONAL)
   (User.Gender)
    for the purpose of (2 3) ; Customization of the Site to Individuals, R&D
    in (0) ; Non-Identifiable Form and the recipients are (0) ; only ourselves and agents
    with consequence of ("a site with clothes you'd appreciate.") ; optional
we read&write
    (User.Shipping.) ; this is a whole data set.
    for the purpose of (0) ; Completing the Current Transaction
    in (1) ; Identifiable Form and the recipients are (0) ; only ourselves and agents

access disclosure (3) ; no access to identifiable data
other disclosures (0 1) ; we make disclosures regarding changing agreement and retention
at http://www.CoolCatalog.com/PrivacyPractice.html

4.2 RDF Encoding of Proposal

The following RDF captures the information as expressed above. P3P proposals MUST be statements that are properly expressed according to the syntax model of RDF as well as valid and well-formed XML. As a specific application of RDF we make a number of assumptions about RDF:

the <RDF:RDF> tag may be optionally omitted.
a tag without a name space should be assumed to be of the P3P namespace.

<?xml:namespace ns=":http//www.w3.org/TR/1998/WD-P3P10-syntax#proposal.DTD" prefix="p3p"?>
<?xml:namespace ns="http://www.w3.org/TR/WD-rdf-syntax#" prefix="RDF"?>
<RDF:RDF><proposal REALM="http://www.CoolCatalog.com/catalogue/" ENTITY="CoolCatalog" 
 ASSURANCE="http://www.TrustUs.org">
  <uses>
  <statement PURPOSE="2, 3" RECIPIENT="0" ID="0"
   CONSEQUENCE="a site with clothes you'd appreciate.">
    <with><prefix NAME="User.">
     <ref NAME="Name.First"/>
     <ref NAME="Bdate.Year" OPTIONAL="1"/>
     <ref NAME="Gender"/>
    </prefix></with>
  </statement>
  </uses>
  <uses>
  <statement ACTION="read&write" PURPOSE="0" RECIPIENT="0" ID="1">
    <ref NAME="User.Shipping."/>
  </statement>
  </uses>
  <DISCLOSURE TEXT="http://www.CoolCatalog.com/PrivacyPractice.html" 
   ACCESS="3" OTHERDISCLOSURE="0 1"/>
</proposal></RDF:RDF>

4.2.1 `proposal` element.

<proposal>: includes one or more statements. Each statement includes a set of disclosures as applied to a set of data elements.
REALM=: the list of URIs to which the proposal applies.
ASSURANCE=: a service that attests that the entity will abide by its proposal, follows guidelines in the processing of data, or other relevant assertions.
ENTITY=: a text field used to describe the legal entity providing the service and entering into the agreement with the user agent.
AGREXP=: the date on which an agreement, if reached, will expire. Default is 6 months. The agreement expiration is the last date when a user agent can transfer data to the service under the agreement. The service continues to be bound by the restrictions of the agreement for data collected under the agreement, even after the expiration. Proposals expire after the time indicated by the "EXPIRES" HTTP header. The default expiration is 1 hour.

The REALM attribute defines the scope of the agreement. It is used by the user agent, for example, to determine whether to automatically transmit a ReturnID or whether to allow a data transfer request. If the proposal is not digitally signed, then each of the Realm URIs must "domain-match" [Kristol] the domain of the server. If the proposal is digitally signed, then the Realm is implicitly expanded to include the current request URI whenever a server outside the original Realm makes a digitally signed proposal referencing an existing agreement.

As noted above, the user agent uses the Realm to determine if the current request URI is subject to any existing agreements. The user agent automatically transfers the ReturnID with the request if a valid agreement requested the ReturnID. This provides a mechanism for the service to maintain state or capture visitor-level statistics.

[20]
proposal

=

&nbssp; `<?xml:namespace ns="http://www.w3.org/TR/WD-P3P-syntax#" prefix="p3p"?>` short-proposal | long-proposal | (`<?xml:namespace ns="http://www.w3.org/TR/WD-rdf-syntax#" prefix="RDF"?>` "<RDF:RDF>" (short-proposal | long proposal ) "<\RDF:RDF>")

The short-proposal is sent when a site wishes to assumes that a agent is aware of the agreement or wishes to refer the agent to a complete proposal at the proposal-URI.

[21]
short-proposal

=

"<proposal" ("AGREEMENTID=" | "AGRID=") agreementID ["proposal-URI=" URI] "/>" "</proposal>"

The longer proposal includes all of the necessary attributes.

[22]
long-proposal

=

"<proposal" ("AGREEMENTID=") quoted-string ["AGREXP=" datetime] ;default is six months "REALM=" `"` URI *(" " URI) `"` "ENTITY=" quoted-string [("ASSURANCE=") `"` URI `"`] ["postURI=" `"` URI `"`] ">" 1*statement-block disclosure "</proposal>"

[23]
schemaURI

=

<"> "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD" <">

[24]
quoted-string

=

<UTF-8 quoted string>

[25]
URI

=

<quoted URI as per RFC 2068 [URI]>

[26]
datetime

=

< date/time as per section 3.3 of RFC 2068>

4.2.2 `statement` element

<statement>: purposes and their qualifiers as applied to data elements.

PURPOSE=
purposes for data processing relevant to the Web.
RECIPIENT=
an organizational area, or domain, beyond the service provider and its agents where data may be distributed.
ID=
is data used in a way that is personally identifiable -- including linking it with identifiable information about you from other sources?
ACTION=
read the data, or read and write

[27]	statement-block	=	"<uses>" "<statement" [("ACTION=" \| "ACT=") `"` action `"`] [("PURPOSE=" \| "PUR=") `"` purpose ("," purpose) `"` ] ; defaults 0 [("ID=" `"` identifiable `"`)] ; defaults 0 [("RECIPIENT=" \| "REC=" )] `"` recipients ("," recipients) `"` ; defaults 0 [("CONSEQUENCE=" \| "CON=") quoted-string] ">" *(datablock) "<statement>" "</uses>"
[28]	purpose	=	("0" \| "Completion and Support of Current Activity") \| ("1" \| "Web Site and System Administration") \| ("2" \| "Customization of the Site to Individuals") \| ("3" \| "Research and Development") \| ("4" \| "Contacting Visitors for Marketing of Services or Products") \| ("5" \| ("Other Uses" string))
[29]	recipient	=	("0" \| "only ourselves and our agents") ("1" \| "organizations following our practices") ("2" \| "organizations following different practices") ("3" \| "unrelated third parties or public forum")
[30]	identifiable	=	("0" \| "no") \| ("1" \| "yes") ; default no
[31]	action	=	("r" \| "read") \| ("rw" \| "read&write") ; default read

4.2.3 The `CATEGORY` Attribute

Categories are attributes of data elements which provide hints to users and user agents as to the intended uses of the data. Categories are vital to making P3P user agents easier to implement and use; they allow users to express more generalized preferences and rules over the exchange of their data. Categories are often included when defining
a new element or referring to form data.

Membership of data elements in categories is generally disjoint; most data elements should belong to just one category. However data elements may belong to more than one category when necessary. In addition, a data set belongs to all of the categories to which its elements belong. Base data elements have all been assigned a default
category in the P3P specification. When a service proposes a new data element, the proposal should include the default category or categories for the element, however the user and/or user agent retains final control over the category assignment.

[32]
categories

=

`"` *(data-category ",") data-category `"`

[33]
data-category

=

("0" | "Physical Contact Information") | ("1" | "Online Contact Information") | ("2" | "Unique Identifiers") | ("3" | "Financial Account Identifiers") | ("4" | "Computer Information") | ("5" | "Navigation and Click-stream Data") | ("6" | "Transaction Data") | ("7" | "Preference Data") | ("8" | "Demographic and SocioEconomic Data") | ("9" | "Content")

4.2.4 `DISCLOSURE` element

Simple disclosures regarding service access capabilities, and binary values as to whether the entity makes disclosures regarding changing the agreement (opt-out) for data already collected and how long data is retained.

DISCLOSURE=
URI of the English language statement
ACCESS=
the ability of the individual to view identifiable information and address questions or concerns to the service provider.
OTHER=
does the site make a its policy regarding change_agreement and retention known at its disclosureURI?

[34]

disclosure

("DISCLOSURE=" | "DIS=" ) URI
"ACCESS=" `"` access-disclosure `"`
[("OTHERDISCLOSURE" | "OTHER=") `"` 
   other-disclosure *("," other-disclosure) `"`]

[35]

access-disclosure

("0" | "Identifiable Data is Not Used") |
("1" | "Identifiable Contact Information") |
("2" | "Other Identifiable Information") |
("3" | "None")

[36]

other-disclosure

[("0" | "change_agreement")] | [("1" | "retention")]

4.3 Data Transfer

4.3.1 Rerefencing data: the `<ref>` tag

References to data sets/elements are all enclosed within a statement using the <ref> tag. The <ref> tag has the following attributes: NAME, DATASCHEMA, VALUE, OPTIONAL, CATEGORY, TYPE, TYPESCHEMA, SHORT.

The NAME attribute is a string denoting the name of a data element/set. Sets and elements are syntactically distinguished by the presence of a trailing dot after the set name. For instance, to indicate the set User.Home, the string User.Home. is used.
The DATASCHEMA attribute is a URI denoting the name of the data schema the data element/set specified in name belongs to. A missing dataschema attribute, which is equivalent to an empty string, means that the dataschema is the basic one referenced by the current P3P version.
The VALUE attribute is a string denoting the actual value of the data element specified in the name attribute. In the case of a data element request, the value attribute is missing.
The OPTIONAL attribute can be "0" or "1". "0" indicates that the data element is required, while "1" indicates that the data element is not required. The default is 0. The optional field is used only in the case of a data element request.

The following four attributes are only used when a new (not defined in the P3P1.0 base) data element or set is referenced.

The TYPE attribute is a string denoting the type of a data element/set.
The TYPESCHEMA attribute is a URI denoting the data schema where the type specified in the type attribute is defined.
The SHORT attribute is a string denoting the short display name of the data element/set.
The LONG attribute is a string denoting a longer description of a data element/set.
The CATEGORY attribute is a string denoting the categories of a data element.

[37]

data-reference

"<ref" "NAME=" `"` data_element `"` [("DATASCHEMA=" | "DSCH=") `"` URI  `"`] 
[("OPTIONAL=" | "OPT=") `"` yesno `"`]
["TYPE="quoted- string] [("TYPESCHEMA=" | "TSCH=") `"` URI `"`] 
["CATEGORY=" categories] 
["SHORT=" quoted-string] 
["LONG=" quoted-string]
"/>"

As an example, to request the the user's shipping address city, all the elements of the data set User.Business.Info. and (optionally) all the elements of the data set User.Home.Info.Phone. the service would send the following references inside a P3P proposal:

<ref NAME="User.Shipping.Info.Address.City"/>
<ref NAME="User.Home.Info.Phone." optional="1"/>
<ref NAME="User.Business.Info"/>

If the user agrees to returning the city and business information and only the international phone code of her home phone number, she returns the following inside the txd tag:

<ref NAME="User.Shipping.Info.Address.City" VALUE="Boston"/>
<ref NAME="User.Home.Info.Phone.IntCode" VALUE="1"/>
<ref NAME="User.Business.Info.Address.Street" VALUE="12 Main St."/>
<ref NAME="User.Business.Info.Address.City" VALUE="Cambridge"/>
<ref NAME="User.Business.Info.Address.Name" VALUE="Smith"/>
...

4.3.2 Prefixing references using `<with><prefix>`

In a set of data references, many element names may share a common parent. For instance, "User." is a common occurrence in the references directly above. The <with><prefix> tags can be used to associate a string prefix to the names of a group of data elements or sets. Consequently, this can be used to shorten the length of the names or to reference new data sets and elements. If a prefix tag is present within a data block, then the block is equivalent to one where the prefix tag is omitted, and each NAME value is prefixed by the prefix. For instance, the above example could be rewritten as:

<with><prefix NAME="User.">
<ref NAME="Shipping.Info.Address.City" VALUE="Boston"/>
<ref NAME="Home.Info.Phone.IntCode" VALUE="1"/>
<ref NAME="Business.Info.Address.Street" VALUE="12 Main St."/>
<ref NAME="Business.Info.Address.City" VALUE="Cambridge"/>
<ref NAME="Business.Info.Address.Name" VALUE="Smith"/>
...
</prefix></with>

The <with><prefix> tags can be nested; the prefixes have to be added proceeding from the innermost <with><prefix> ... </prefix></with>combination (here, innermost means that there are no other <with><prefix> tags within. So, the above example could be rewritten even more compactly in the following way:

<with><prefix NAME="User.">
<ref NAME="Shipping.Info.Address.City" VALUE="Boston"/>
<ref NAME="Home.Info.Phone.IntCode" VALUE="1"/>
<with><prefix NAME="Business.Info.Address.">
    <ref NAME="Street" VALUE="12 Main St."/>
    <ref NAME="City" VALUE="Cambridge"/>
    <ref NAME="Name" VALUE="Smith"/>
    ...
</prefix></with>
...
</prefix></with>

Note: use of two tags (<with><prefix>) in place of one (just <prefix>) is technically due to the fact we are employing the RDF encoding.

[38]
datablock

=

*datareference | ( *datareference "<with><prefix" "NAME=" quoted-string [("DATASCHEMA=" | "DSCH=") quoted-string] [("OPTIONAL=" | "OPT=") quoted-string] ["TYPE=" quoted-string] [("TYPESCHEMA=" | "TSCH=") quoted-string] ["CATEGORY=" quoted-string] ["SHORT=" quoted-string]
["LONG=" quoted-string]
["TEMPLATE=" ("0" | "1") ">" datablock "</prefix></with"> datablock )

4.3.3 Client Side Writes

Services have the capacity to ask that information be written to the user repository. Consider a P3P agent receiving the following request as part of a proposal:

<statement ACTION="r" PURPOSE="2" ID="0"/>
    <ref NAME="Web.PUID"/>
    <ref NAME="User.PersonName.First"/>
</statement>
<statement ACTION="rw" PURPOSE="2" ID="0">
    <ref NAME="FineShoes.shoesize"
     DATASCHEMA="http://www.FineShoes.com/Schema1.0" />
</statement>

The user could be prompted, asked if she agrees to this proposal, and then shown a form like interface with the User.PersonName.First auto-filled (assuming she's entered that before) and a blank field for FineShoes.shoesize. The user would enter this information and transmit it (TXD).

The user sends:

<txd r="200" AGREEMENTID="94df1293a3e519bb" />
<ref NAME="User.PersonName.First" VALUE="Josephine"/>
<ref NAME="Web.PUID" VALUE="1234567"/>
<ref NAME="FineShoes.Shoesize" DATASCHEMA="http://www.FineShoes.com/Schema1.0" VALUE="7"/>
</txd>

Whenever the attribute "rw" is applied to a data element, and a new value is provided by the user, the user agent must attempt a write to the data repository. If that attempt fails, the agent should return the appropriate response code. This auto-write eliminates the need for the service to retransmit information back to the user agent. However, the service may send a (TXD) at any time. For instance, even though the user provided their shoesize as 7, the service wishes to convert it into European shoe sizes and write that value instead.

4.3.4 Unambiguous Optional Elements and Purposes

As described elsewhere, optional elements are denoted by the optional attribute of the <ref> tag. Since elements are part of a statement, they are associated with the purpose, actions, and qualifiers of that statement -- for brevity we shall only refer to the purpose in this section. Consequently, an optional element should be read as "this element is optional for the purposes declared in this statement." For instance, Web.PUID and User.Home.Info.First are collected for purpose 2 and 3 in non identifiable form:

<statement PURPOSE="2, 3" ID="0" RECIPIENT="0"
             CONSEQUENCE="a personalized site!"/>
    <ref NAME="Web.PUID"/>
    <ref NAME="User.Home.Info.First"/>
</statement>

In order to avoid ambiguity an element may only appear once in a proposal. If an element is desired (required or optional) for a single purpose its representation is unambiguous. That element only appears in one statement and (if optional) is returned if the user opts to do so.

The case in which a single element is required for one purpose but optional for other purposes is problematic. First, how does the service represent this state in its proposal? Second, how does the agent unambiguously represent the fact that it is providing the element, but for only one of the purposes? For instance, a service may wish to have shipping information for sending the user a product and (optionally) use it for contacting the user for marketing purposes.

A non-conformant proposal could be created where the element appears in one statement for the purpose of shipping, but appears optionally in another statement for marketing. However, the data transfer syntax does not support a method of returning the data and representing which of the required and optional purposes the user agreed to for a single element. This is why an element may only appear once in a proposal. Purposes which are optionally applied to an element that is required elsewhere MUST be represented with multiple proposals. When the constraint is followed, each agreementID as applied to the returned data is an unambiguous representation of what the user agreed to.

For example, if a service requires shipping information for completing the transaction and would like to optionally use all of this data for optionally for marketing purposes, it should send the following two proposals.

<?xml:namespace ns="http://www.w3.org/TR/1998/WD-P3P10-syntax#proposal.DTD" prefix="p3p"?> <?xml:namespace ns="http://www.w3.org/TR/WD-rdf-syntax#" prefix="RDF"?> <proposal AGREEMENTID="1e3a5d71297d104f>"
REALM="http://www.CoolCatalog.com/catalogue/"/>
<uses>
   <statement PURPOSE="2" ID="1"
    CONSEQUENCE="We can ship your order to your door step">
      <ref NAME="User.ShipTo."/>
   </statement> </uses>
<DISCLOSURE text="http://www.CoolCatalog.com/PrivacyPractice.html"
ACCESS="3" OTHERDISCLOSURE="0 1"/>
</proposal>
<proposal AGREEMENTID="2e3a5d71297d104f"
REALM="http://www.CoolCatalog.com/catalogue/" optional="1"> ; note optional is in header.
<uses>
   <statement PURPOSE="4" ID="1"
    CONSEQUENCE="We will send you our world-renown catalogue!">
      <ref NAME="User.ShipTo."/>
   </statement> </uses>
<DISCLOSURE text="http://www.CoolCatalog.com/PrivacyPractice.html"
ACCESS="3" OTHERDISCLOSURE="0 1"/>
</proposal>

Multiple proposals should be constructive so as to be additive and discrete. Proposals should not be constructed such that a user must choose between two proposals (A+B) and (A), where A and B represent policies. Rather, they should be presented with (A) and (B) and choose as many as appropriate.

In our example, if the user only agrees to the proposal for using the shipping information for the completion of the transaction, it would return the information under that agreement ID.

<txd r="200" AGREEMENTID="1e3a5d71297d104f" /> <with><prefix NAME="User.ShipTo.Info."> <ref NAME="User.Info.Posal.Name" VALUE="Josephine Hound"/> ... <ref NAME="Phone.Number" VALUE="617.555.7621"/> </prefix></with>

Note that this solution results in the transmission of potentially redundant information in multiple proposals and data transfers. However this is preferable to the greater complexity and latency of reaching the agreement if services can send unambiguous proposals and user agents must then return unambiguous representations of their agreement.

4.4 Creating New Data Sets

4.4.1 Data Definition

Within a proposal, new data elements and data sets can be created. This occurs when a reference to an unknown data element or data set is encountered within the proposal. In such an event, if the user agrees, the new data elements are created within the user's repository. In order to create a new data element, the agent needs the following: its name, the data schema it belongs to, the category, the data type (and, in case, the data schema where the type has been defined), and the short display name.

Also, a service may optionally provide a more detailed description of the data element in the form of a special schema-statement statement. This information can be provided in two ways:

inline within the <ref> tags, using the corresponding attributes category, type, typeschema, short (see the next subsection); or
outline, within the definition of the data schema provided by the dataschema attribute.

If information is inline and the origin server URI does not match the dataschema URI, the information in the main data schema MUST be checked in order to verify the consistency of the information. Otherwise, the information in the main data schema MAY be checked in order to verify consistency of the information. In case of a mismatch, a 530 return code (Invalid Format) must be returned. If for any other reason the user is unable to reconstruct the needed information, a 433 code (Data Unavailable) must be returned.

4.4.2 Data Schema Format

The format of a new data schema is a special statement of the form

<?xml:namespace ns="http://www.w3.org/TR/WD-P3P-syntax#" prefix="P3P"?>
<proposal><uses><statement>
  ....
<statement></uses><proposal>

A data block is enclosed within the <data> tags and contains references to the data elements. References can be made using the <ref> tag and its attributes:

NAME (to define the name of the data element)
CATEGORY (to define the category)
TYPE (to define the type)
TYPESCHEMA (to define the data schema where the type is defined)
SHORT (to define the short display name, no more than 30 characters.)
LONG (to provide a longer description of the data item, no more than 250 characters)

Like in the data transfer, the <prefix> tag can be used to shorten the schema definition, by associating a common prefix to many <ref> attributes.

For every data element, all of the information except LONG is mandatory. If one of the attributes is missing, it is presumed to be present with an empty string. In the case of the TYPESCHEMA, the empty string value has the special meaning that the type schema coincides with the value of the DATASCHEMA attribute.

For example, suppose the company HyperSpeed wants to build the following data schema:

car.model
car.color
car.built.year
car.built.where. (of basic type Postal.)
car.price
bike.model
bike.color
bike.built.year
bike.built.where. (of basic type Postal.)
bike.price

Then, it could just place the following code at http://www.HyperSpeed.com/models-schema

<?xml:namespace ns="http://www.w3.org/TR/WD-P3P-syntax#" prefix="P3P"?>
<proposal>
<uses>
<statement>
<with><prefix NAME="car." SHORT="Car">
 <ref NAME="model" TYPE="Text" SHORT="Model"\>
 <ref NAME="color" TYPE="Text" SHORT="Color"\>
 <with><prefix NAME="built." SHORT="Construction data">
  <ref NAME="year" TYPE="Number" SHORT="Year"\>
  <ref NAME="where." TYPE="Postal."\>
 </prefix></with>
</prefix></with>
<ref NAME="bike." TYPE="car." TYPESCHEMA="http://www.HyperSpeed.com/models-schema" SHORT="Bike "\>
</uses>
</statement>
</proposal>

Note that every time a data set is created, it can be implicitly used as a type, just like the car. case above. However, in some situations one may wish to define a type without creating a specific element within the user's repository. This can be accomplished by using the TEMPLATE attribute in the <ref> . Setting the value to yes, TEMPLATE="1" (default is "No", or 0), means that the corresponding data element is part of a type definition only, and is not actually representing a data element with an associated value. For example, HyperSpeed might want to define a GenericModel. type of general utility, and then instantiating it with car. and bike. This could be done with the following schema:

<?xml:namespace ns="http://www.w3.org/TR/WD-P3P-syntax#" prefix="P3P"?>
<proposal>
<uses>
<statement>
<with><prefix NAME="GenericModel." TEMPLATE="Yes">
 <ref NAME="model" TYPE="Text" SHORT="model"\>
 <ref NAME="color" TYPE="Text" SHORT="color"\>
 <with><prefix NAME="built." SHORT="Construction data">
  <ref NAME="year" TYPE="Number" SHORT="Year"\>
  <ref NAME="where." TYPE="Postal."\>
 </prefix></with>
</prefix></with>
<ref NAME="car." TYPE="GenericModel." TYPESCHEMA="http://www.HyperSpeed.com/models-schema" SHORT="Car"\>
<ref NAME="bike." TYPE="GenericModel." TYPESCHEMA="http://www.HyperSpeed.com/models-schema" SHORT="Bike"\>
</uses>
</statement>
</proposal>

5. Base Data Set and Data Types

5.1 Required (Base) Data Elements and Sets

The following are the base data elements and sets. These data elements and sets (as named) may not be modified or deleted by either the service or the user agent. However the user can provide any arbitrary value. For instance the agent may return no value or a value that differs depending on the active persona. We expect that in the future, there will be demand for the creation of other data sets and elements. Obvious applications include catalogue, payment, and agent/system attribute schemas. (An extensive set of system elements is provided in a white paper draft and are based on http://www.w3.org/TR/NOTE-agent-attributes.)

Each table below specifies a set, the elements within the set, the category associated with the element, its type, and the display name shown to users. More than one category may be associated with an element, particularly Web.Abstract. However, we have designed the base categories such that the base elements do no overlap or belong to more than one category when possible. We recommend that data schema designers do the same.

The Web. data set includes specific queries to the user agent.

Web. Category Type Short display name

PUID Unique Identifiers Number Pairwise or Site ID

TUID Unique Identifiers Number Temporary or Session ID

This data set includes elements used to uniquely identify a user to a single service. TUIDs are used for the purpose of retaining state for a given session. PUIDs are used to retain state over multiple sessions. Neither of these terms imply that users are personally identifiable nor that they are required to share a single UUID with multiple sites. Rather, each ID should be unique such that if both Jane's and Adam's user agents create an ID for interaction with the Cool.Com site, each of their IDs will be unique.

The Web.Abstract. data set serves the purpose of referencing HTTP data collection activity that is not collected through P3P methods.

Web.Abstract. Category Type Short display name

ClientClickStream Navigation and Click-stream Data, * Boolean Click-stream collected on the server.

ServerClickStream

Navigation and Click-stream Data, *
Boolean Click-stream collected on the client.

StoreNegotiation Transaction Data Boolean Server stores the negotiation history

Form * Boolean Data entered through forms

Form.SearchText Transaction Data Boolean Search terms

Unlike all other data elements that have values which are generated or stored by the agent, the WebAbstract elements do not have associated data values. Rather, they allow a service to declare the collection of data by means other than a request for a P3P data transfer. We call these elements abstract. When abstract elements appear within the same statement as other elements, this asserts that once the data is collected through the abstract method, the service will likely wish to read or write that information to the specific elements within the user's repository in the future.

These elements are often implicit in navigation or Web interactions. They should be used with categories to describe the type of information collected through these methods. Form includes information collection methods controlled by the service, such as HTML forms or Java(script) interfaces. Form.SearchText is a specific type of form solicitation used for searching and indexing sites. Consequently, if the only form fields on a search engine page are search fields, it need only disclose that data element.

The User. data set includes general information about the computer user.

User. Category Type Short display name

Name. Physical Contact Information, Demographic and SocioEconomic Data PersonName. User's Name

Bdate. Demographic and SocioEconomic Data Date. User's Birth Date

Cert Unique Identifiers Text User's Identity Certificate

Gender Demographic and SocioEconomic Data Gender User's Gender

Employer Demographic and SocioEconomic Data Text Name of user's employer

Department Demographic and SocioEconomic Data Text Department or division of organization where user is employed

JobTitle Demographic and SocioEconomic Data Text User's job title

Home. Physical Contact Information,
Online Contact Information, Demographic and SocioEconomic Data Info. User's Home Contact Information

Business. Physical Contact Information,
Online Contact Information, Demographic and SocioEconomic Data Info. User's Business Contact Information

BillTo. Physical Contact Information,
Online Contact Information, Demographic and SocioEconomic Data Info. User's Billing Address

ShipTo. Physical Contact Information,
Online Contact Information, Demographic and SocioEconomic Data Info. User's Shipping Address

Note, that this data set includes elements that are actually sets of data themselves. These sets are defined in the data types subsection of this document. The short display name for an individual element contained within a data set is defined as the concatenation of the short display names that have been defined for the set and the element, separated
by commas. For example, the short display name for User.Home.Postal.PostalCode would be "User's Home Contact Information, Postal Address Information, Postal Code". User agent implementations may prefer to develop their own short display names rather than using the concatenated names when prompting users for information.

5.2 Data Types

The Date. type is a structured type that specifies a date.

Date. Category Type Short display name

Year Demographic and SocioEconomic Data Number Year

Month Demographic and SocioEconomic Data Number Month

Day Demographic and SocioEconomic Data Number Day

Hour Demographic and SocioEconomic Data Number Hour

Minute Demographic and SocioEconomic Data Number Minute

Second Demographic and SocioEconomic Data Number Second

FractionSecond Demographic and SocioEconomic Data Number Fraction of Second

TimeZone Demographic and SocioEconomic Data Text Time Zone

The PersonName. type is a structured type that specifies information about the naming of a person.

PersonName. Category Type Short display name

Prefix Demographic and SocioEconomic Data Text Name Prefix

First Physical Contact Information Text First Name

Last Physical Contact Information Text Last Name

Middle Physical Contact Information Text Middle Name

Suffix Demographic and SocioEconomic Data Text Name Suffix

Nickname Demographic and SocioEconomic Data Text Nickname

The PhoneNum. type is a structured type that specifies the characteristics of a phone number.

PhoneNum. Category Type Short display name

IntCode Physical Contact Information Number International Phone Code

LocCode Physical Contact Information Number Local Phone Area Code

Number Physical Contact Information Number Phone Number

Ext Physical Contact Information Number Phone Extension

Comment Physical Contact Information Text Phone Optional Comments

The Info. type is a structured, redirected, type to other types. This is done so that requests for data need not collect information they do not need even if they wish to use the abbreviate set notation in a request.

Info. Category Type Short display name

Postal. Physical Contact Information, Demographic and
SocioEconomic Data Postal. Postal Address Information

Telecom. Physical Contact Information Telecom. Telecommunications Information

Online. Online Contact Information Online. Online Address Information

The Postal. type is a structured type that specifies a postal mailing address.

Postal. Category Type Short display name

Name. Physical Contact Information, Demographic and
SocioEconomic Data PersonName. Name

Street Physical Contact Information Text Street Address

City Physical Contact Information Text City

StateProv Physical Contact Information Text State or Province

PostalCode Physical Contact Information Text Postal Code

CountryCode Physical Contact Information Country Country

Country Physical Contact Information Text Country String

The Telecom. type is a structured type that specifies telecommunication information about a person.

Telecom. Category Type Short display name

Phone Physical Contact Information PhoneNum. Phone Number

Fax Physical Contact Information PhoneNum. Fax Number

Mobile Physical Contact Information PhoneNum. Mobile Phone Number

Pager Physical Contact Information PhoneNum. Pager Number

The Online. type is a structured type that specifies telecommunication information about a person.

Online. Category Type Short display name

Email Online Contact Information Text Email Address

URI Online Contact Information URI Home Page Address

This specification uses the following primitive data element datatypes, we reference specifications in keeping with XML1.0, HTTP/1.1, and XML-data as ordered.

Primitive DataType Definition

Text UTF-8

Gender Text of character "M" or "F".

Boolean Text of character "0" or "1".

Binary Base64 per RFC-1531. [MIME]

Number Text of characters composed with the digits "0", "1", "2", "3", "4", "5", "6", "7", "8", "9".

UUID [UUID]

Country ISO 1366

URI [URI]

5.3 Syntax of Data Elements and Sets

[1]

data-set

1*<the prefix ending with a dot of a data element>

Appendices

Appendix 1: References

[DSIG]: Y. Chu, P. DesAutels, B. LaMacchia, P. Lipp. "DSig 1.0 Signature Labels Specification: Using PICS 1.1 Labels for Making Signed Assertions," World Wide Web Consortium. 03-April-1998. (Recommendation)
[HARMV]: J. Reagle. "P3P Harmonized Vocabulary Specification," World Wide Web Consortium. 30-March-1998. (Working Draft)
[HTTP1.0]: T. Berners-Lee, R. Fielding, H. Frystyk, "RFC 1945 -- Hypertext Transfer Protocol -- HTTP/1.0," W3C/MIT, UC Irvine, W3C/MIT, May 1996.
[HTTP1.1]: R. Fielding, J. Gettys, J.C. Mogul, H. Frystyk, T. Berners-Lee, "RFC 2068 -- Hypertext Transfer Protocol -- HTTP/1.1," UC Irvine, Digital Equipment Corporation, MIT.
[ISO3166]: "ISO3166: Codes for The Representation of Names of Countries." International Organization for Standardization, December, 1993.
[MANDA]: H. Frystyk Nielsen, P. Leach, Scott Lawrence. "Mandatory Extensions in HTTP." January 1998. (draft-frystyk-http-mandatory).
[MD5]: R. Rivest. "RFC 1321 -- The MD5 Message-Digest Algorithm," MIT. April 1992.
[MIME]: N. Freed, N. Borenstein. "MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies." November 1996.
[NAMES]: T. Bray, D. Hollander, A. Layman. "Namespaces in XML." World Wide Web Consortium. 18-May-1998. (Working Draft)
[RDF]: O. Lassila, R. Swick. "Resource Description Framework (RDF) Model and Syntax Specification," World Wide Web Consortium. 16-February-1998. (Working Draft)
[SSL]: A. Freier, P. Karlton, P. Kocher. "SSL 3.0 Specification." http://home.netscape.com/eng/ssl3/index.html
[STATE]: Network Working Group, D. Kristol, Bell Laboratories, Lucent Technologies; Category: Standards Track HTTP State Management Mechanism. ftp://ftp.isi.edu/in-notes/rfc2109.txt.
[URI]: T. Berners-Lee, R. Fielding, and L. Masinter. "Uniform Resource Identifiers (URI): Generic Syntax and Semantics." 1997. (Work in progress; see updates to RFC1738.)
[UTF-8]: F. Yergeau "RFC 2279 -- UTF-8, a transformation format of ISO 10646." January 1998.
[UUID]: The Open Group. "Universal Unique Identifier. Appendix CAE Specification C706." DCE 1.1: Remote Procedure Call. 1997.
[VCARD]: "vCard - The Electronic Business Card Version 2.1." Internet Mail Consortium, September 18, 1996.
[XML]: T. Bray, J. Paoli, C. M. Sperberg-McQueen. "Extensible Markup Language (XML) 1.0 Specification," World Wide Web Consortium." 10-February-1998. (Recommendation)

Appendix 2: Line-flow Scenario

This scenario is based on scenarios 1/5 and 3 and includes negotiation.

A user directs their Web browser to go to home page of CoolCatalog. She has never been there before; she requests a P3P proposal as part of the request by advertising that her browser understands P3P.

GET / HTTP/1.1
Accept: */*
User-Agent: BugblatterBeast/3.02 (RT-11; English)
Host: www.CoolCatalog.com
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-42

CoolCatalog sends a proposal, including privacy practices, disclosures, and the data elements to which they apply. In this instance, the service asks to be given the PUID as part of the returnID automatically in the future and requests the user's gender from the user repository.

HTTP/1.1 409 Agreement required
Server: Marvin/2.0.1
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1944
1944-P3P1.0: <P3P><?xml:namespace ns=":http//www.w3.org/TR/1998/WD-P3P10-syntax#proposal.DTD" prefix="p3p"?>
<proposal AGREEMENTID="94df1293a3e519bb"
 REALM="http://www.CoolCatalog.com/" ENTITY="CoolCatalog" 
 ASSURANCE="http://www.TrustUS.org" />
  <uses>
  <statement PURPOSE="2" ID="0" RECIPIENT="0" CONSEQUENCE="a personalized site!"/> 
    <ref NAME="Web.PUID"/>
    <ref NAME="User.Gender"/>
  </statement>
  </uses>
  <DISCLOSURE text="http://www.CoolCatalog.com/PrivacyPractice.html" 
   ACCESS="3" OTHERDISCLOSURE="0 1"/>
</proposal></P3P>
Content-Type: text/html
Content-Length: 110

<html><body>
<h1>HTTP/1.1 409 Agreement Required</h1>
<p>We need an agreement before we can continue.</p>
</body></html>

[Note that the proposal is split across multiple lines for readability; over the network, a CRLF pair would be added only after the </proposal>. We will use this in the example lineflows below as well.]

The user refuses this proposal. Imagine that the agreementID (hash) of the above proposal is "e3a5d71297d104f1"; the user's next HTTP request would look like the following:

GET / HTTP/1.1
Accept: */*
User-Agent: BugblatterBeast/3.02 (RT-11; English)
Host: www.CoolCatalog.com
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1776
1776-P3P: <sry r="432" AGREEMENTID="e3a5d71297d104f1" />

The site offers a new proposal, this time it requests the automatic returnID, and from the user's repository the first name and optional age, both for the purposes of customizing the site.

HTTP/1.1 409 Agreement required
Server: Marvin/2.0.1
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1492
1492-P3P1.0: <P3P><?xml:namespace ns=":http//www.w3.org/TR/1998/WD-P3P10-syntax#proposal.DTD" prefix="p3p"?><proposal REALM="http://www.CoolCatalog.com/" ENTITY="CoolCatalog" 
 ASSURANCE="http://www.TrustUs.org"/>
  <uses>
  <statement PURPOSE="2" ID="0" CONSEQUENCE="a personalized site!">
    <with><prefix NAME="User.">
      <ref NAME="Name.First"/>
      <ref NAME="Bdate.Year" optional="1"/>
    </prefix></with>
    <ref NAME="Web.PUID"/>
  </statement>
  </uses>
  <DISCLOSURE text="http://www.CoolCatalog.com/PrivacyPractice.html" 
  ACCESS="3" OTHERDISCLOSURE="0 1"/>
</proposal></P3P>
Content-Type: text/html
Content-Length: 110

<html><body>
<h1>HTTP/1.1 409 Agreement Required</h1>
<p>We need an agreement before we can continue.</p>
</body></html>

The user agrees only to the provision of the automatic returnID and first name for customization only, and sends the data. Assume that the agreementID of the above proposal is "94df1293a3e519bb".

GET / HTTP/1.1
Accept: */*
User-Agent: BugblatterBeast/3.02 (RT-11; English)
Host: www.CoolCatalog.com
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1861
1861-P3P1.0: <P3P><txd r="200" AGREEMENTID="94df1293a3e519bb" >
 <ref PURPOSE="2" NAME="User.Name.First" VALUE="Josephine"/>
 <ref PURPOSE="2" NAME="User.PUID" VALUE="1234567"/>
</txd></P3P>

At this point, the service would return the CoolCatalog homepage.

HTTP/1.1 200 OK 
Server: Marvin/2.0.1
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1950
1950-P3P: <ok DH="s24fd20kuqexg5xk" />
Content-Type: text/html
Content-Length: xxx
[content of the CoolCatalog homepage]

The user returns to the site a day later, sending the returnID automatically.

GET / HTTP/1.1
Accept: */*
User-Agent: BugblatterBeast/3.02 (RT-11; English)
Host: www.CoolCatalog.com
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-2001
2001-P3P: <txd AGREEMENTID="94df1293a3e519bb" >
 <ref NAME="User.PUID" VALUE="1234567"/>
</txd>

However, the server again needs the user's first name, so it asks for it under the previous agreement.

HTTP/1.1 409 Agreement required
Server: Marvin/2.0.1
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-2010
2010-P3P: <proposal AGREEMENTID="94df1293a3e519bb"/>
</proposal>
Content-Type: text/html
Content-Length: 70

<html><body>
<h1>HTTP/1.1 400 Agreement Required</h1>
</body></html>

The user returns their first name and PUID.

GET / HTTP/1.1
Accept: */*
User-Agent: BugblatterBeast/3.02 (RT-11; English)
Host: www.CoolCatalog.com
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1968
1968-P3P1.0: <P3P><txd AGREEMENTID="94df1293a3e519bb" >
 <ref NAME="User.Name.First" VALUE="Josephine"/>
 <ref NAME="User.PUID" VALUE="1234567"/>
</txd></P3P>

The service would then respond with their homepage, as before:

HTTP/1.1 200 OK 
Server: Marvin/2.0.1
OPT: "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD"; ns-1950
1950-P3P1.0: <ok DH="s24fd20kuqexg5xk" />
Content-Type: text/html
Content-Length: xxx
[content of the CoolCatalog homepage]

Appendix 3: DTDs

The DTD's provided in the Appendix are provided for showing an example of the name space in use, and as a non-normative exercise in ensuring the robustness of the data structures. However, it is known to be incomplete. The normative notation for this specification is ABNF. The scope of the DTD's name space is provided through the following mechanism for each of the three sections below:

proposal.DTD

<!-- 
  Typical usage:

  <?XML version="1.0" encoding="UTF-8" ?>
  <!DOCTYPE proposal SYSTEM
    "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD">

  <proposal> ... </proposal>

-->

<!--======= Import TXD and Qualifier DTD =======-->


<!ENTITY % TXD PUBLIC
   "http://www.w3.org/TR/1998/WD-P3P10-harmonization#TXD.DTD">
%TXD;
<!ENTITY % qualifier PUBLIC
   "http://www.w3.org/TR/1998/WD-P3P10-harmonization#qualifier.DTD">
%qualifier;


<!--======= Internal Entities for Terseness =======-->

<!ENTITY % proposal-schema
  "http://www.w3.org/TR/1998/WD-P3P10-harmonization#Proposal.DTD">
<!--======= The Proposal =======-->

<!ELEMENT proposal (uses,disclosure) >
<!ATTLIST proposal
    SCHEMA CDATA %proposal-schema
    REALM        CDATA             #REQUIRED <!-- URIs -->
    ASSURANCE    CDATA             #REQUIRED <!-- URI -->
    ENTITY       CDATA             #REQUIRED
    OPTIONAL     ("0" | "1")       "0"
>
<!ELEMENT uses (statement+) >
<!ELEMENT statement (qualifier,data+) >
<!ATTLIST statement
    ACTION        ("r" | "rw" )     "r"
    CONSEQUENCE   CDATA             #IMPLIED
>
<!ELEMENT disclosure EMPTY>
<!ATTLIST disclosure
    DISCLOSURE    CDATA             #REQUIRED
    ACCESS   (    ("0" | "Identifiable Data is Not Used") |
                  ("1" | "Identifiable Contact Information") |
                  ("2" | "Other Identifiable Information") |
                  ("3" | "None") )                                   "0"
    OTHER    (("0" | "change_agreement") | ("1" | "retention"))      #IMPLIED

Qualifier DTD [May-13]

<!ELEMENT qualifier EMPTY>
<!ATTLIST qualifier
    purpose  (  ("0" | "Completion and Support of Current Activity") |
                ("1" | "Web Site and System Administration") |
                ("2" | "Customization of the Site to Individuals") |
                ("3" | "Research and Development") |
                ("4" | "Contacting Visitors for Marketing of 
                        Services or Products") | 
                ("5" | ("Other Uses" quoted-string) )                "0"
    recipient ( ("0" | "Only Ourselves and our Agents") |
                ("1" | "Organizations Following Our Practices") |
                ("2" | "Organizations Following Different Practices") |
                ("3" | "Unrelated Third Parties or Public Fora") )   "0"
    id          ("0" | "1")                                          "0"  
>

TXD DTD [May-13]

<!ELEMENT data (prefix,ref+)>
<!ELEMENT prefix EMPTY>
<!ATTLIST prefix
    name     CDATA    #REQUIRED
    schema   CDATA    #IMPLIED     <!-- URI -->
>
<!ELEMENT ref EMPTY>
<!ATTLIST ref
    name     CDATA            #IMPLIED
    optional ("0" | "1")      "0"
>

Appendix 4: Fingerprints and Canonicalization

P3P uses fingerprints (Message Digest, Hash) as a means of identifying proposals and data sets. These fingerprints yield a fixed size value resulting from the application of a hash function to a set of data. Fingerprints are easily computed, but even small modifications to the original information result in a different fingerprint value. Consequently, fingerprints are often used as methods of assuring information integrity. We use the MD5 digest algorithm for P3P fingerprints. This yields a 128-bit value which will be encoded and exchanged in P3P communications. Both agent and service can use these fingerprints to reference a given proposal and to ensure the integrity of the transaction (validating proposal against fingerprint).

4.1 Generating a fingerprint

The process for generating a fingerprint is fairly straight forward. For fingerprints to be identical, the data fingerprinted by sender and recipient must be identical. Our goal is to render the data to be fingerprinted into a standard form (canonicalized) such that the effect of any reformatting by intermediaries is nullified. After this is done, we use the canonicalized version of the proposal to generate the fingerprint. Given the same proposal, both the service and agent should be able to arrive at the same fingerprint value.

Proposal cannonicalization rules

the first element in a proposal is the <proposal...> element
all elements begin with the "<" character, all leading spaces must be removed.
there must be no space between the "<" and the element name
- for opening elements the element name, in lower case immediately follows the left bracket "<"
- for closing elements, the left bracket is immediately followed by right slash "/" and then the element name in lower case.
for elements with attributes
- one space is inserted before an attribute name
- attribute names must be uppercase followed immediately by one equals symbol "="and then immediately by the quoted attribute data, the attribute data is taken as literal and must not be reformatted or altered in any way
if present, the closing right slash symbol "/" must immediately follow the final quote in attribute data
all elements must be closed by a right bracket ">" immediately following the right slash "/" or the quoted attribute data
all right brackets ">" must be followed by CRLF
the proposal endes with the </proposal> element

As an example, consider the following XML:

<rdf:rdf><proposal AGREEMENTID="94df1293a3e519bb" REALM="http://www.CoolCatalog.com/catalogue/" ENTITY="CoolCatalog" ASSURANCE="http://www.TrustUs.org"> <uses> <statement CONSEQUENCE="a site with clothes you'd appreciate." PURPOSE="2, 3" RECIPIENT="0" ID="0"> <with><prefix NAME="User."> <ref NAME="Name.First"/> <ref NAME="Bdate.Year" OPTIONAL="1"/> <ref NAME="Gender"/> </prefix></with> </statement> </uses> <uses> <statement ACTION="read&write" PURPOSE="0" RECIPIENT="0" ID="1"> <ref NAME="User.Shipping."/> </statement> </uses> <disclosure TEXT="http://www.CoolCatalog.com/PrivacyPractice.html" ACCESS="3" OTHERDISCLOSURE="0 1"/> </proposal></rdf:rdf>

After canonicalization, it would look like:

<proposal AGREEMENTID="94df1293a3e519bb" REALM="http://www.CoolCatalog.com/catalogue/" ENTITY="CoolCatalog" ASSURANCE="http://www.TrustUs.org"> <uses> <statement CONSEQUENCE="a site with clothes you'd appreciate." PURPOSE="2, 3" RECIPIENT="0" ID="0"> <with> <prefix NAME="User."> <ref NAME="Name.First"/> <ref NAME="Bdate.Year" OPTIONAL="1"/> <ref NAME="Gender"/> </prefix> </with> </statement> </uses> <uses> <statement ACTION="read&write" PURPOSE="0" RECIPIENT="0" ID="1"> <ref NAME="User.Shipping."/> </statement> </uses> <disclosure TEXT="http://www.CoolCatalog.com/PrivacyPractice.html" ACCESS="3" OTHERDISCLOSURE="0 1"/> </proposal>

Generating the MD5 fingerprint

Once the proposal is canonicalized, we use the MD5 algorithm to generate the P3P fingerprint. This section provides an overview of MD5 and an explanation of how a MD5 digest is encoded.

MD5 Overview

The MD5 hash algorithm is defined in

The MD5 Message Digest Algorithm, R.L. Rivest, RFC 1321, April 1992

The MD5 algorithm (Message Digest 5) is a cryptographic message digest algorithm.

MD5 was designed by Ron Rivest, who is also the `R' in `RSA' in 1991. MD5 is described in RFC1321. C source code is included with the RFC. It is basically MD4 with "safety-belts" and while it is slightly slower than MD4, it is more secure. The algorithm consists of four distinct rounds, which have a slightly different design from that of MD4. Message-digest size, as well as padding requirements, remains the same. Den Boer and Bosselaers [B. den Boer and A. Bosselaers. Collisions for the compression function of MD5. In Advances in Cryptology - Eurocrypt '93, pages 293-304, Springer-Verlag, 1994.] have found pseudo-collisions for MD5 (see RSA FAQ Question 98), but there are no other known cryptanalytic results.

The MD5 algorithm takes as input a message of arbitrary length and produces as output a 128-bit "fingerprint" or "message digest" of the input. It is conjectured that it is computationally infeasible to produce two messages having the same message digest, or to produce any message having a given prespecified target message digest. The MD5 algorithm is intended for digital signature applications, where a large file must be "compressed" in a secure manner before being encrypted with a private (secret) key under a public-key cryptosystem such as RSA or PGP.

For further information on MD5, see:

P3P Encoding

Computing the MD5 digest yields 4 32-bit values. All values are encoded using the standard base-64 representation of a byte-array containing the two's-complement representation of the value to encode. The first byte in this array is the high-order byte. The minimal number of bytes necessary is used to represent a certain value, so no leading zero-bytes are allowed.

The BNF below shows how a MD5 digest is encoded for P3P communications.

resource-hash     ::= '"base64-string encoding of 128 bit MD5 message
                        digest of the information resource."'
base64-string     ::=as defined in RFC-1521, section 5.2.

Appendix 5: ABNF Notation

The formal grammar of P3P is given in this specification using a slight modification of the ABNF defined in http://info.internet.isi.edu/in-notes/rfc/files/rfc2234.txt . The following is a simple description of the ABNF.

name = (element): where <name> is the name of the rule, <elements> is one or more rule names or terminals combined through the operands provided below. Rule names are case-insensitive.
(element1 element2): elements enclosed in parentheses are treated as a single element, whose contents are strictly ordered.
<a>*<b>element: at least <a> and at most <b> occurrences of the element.; (1*4<element> means one to four elements.)
<a>element: exactly <a> occurrences of the element.; (4<element> means exactly 4 elements.)
<a>*element: <a> or more elements; (4*<element> means 4 or more elements.)
*<b>element: 0 to <b> elements.; (*5<element> means 0 to 5 elements.)
*element: 0 or more elements.; (*<element> means 0 to infinite elements.)
[element]: optional element, equivalent to *1(element).; ([element] means 0 or 1 element.)
"string" or 'string': matches a literal string matching that given inside the double quotes.

Other notations used in the productions are:

; or /* ... */: comment.

Appendix 6: Possible Future Features/Work

The issue of HTML forms is a complicated one. This capability is desired so that sites will have greater presentational control during the solicitation of information. However it complicates the P3P architecture, user experience, and introduces security/privacy weaknesses (information is now being solicited by an HTML form, which is less trust-worthy than the user's user agent). In the future, if one is willing to specify behavior for HTML parsing and presentation, one could specify requirements on the form FIELD such that if the value is equal to a corresponding element name in the proposal the HTML client could auto-fill it for the user. The post method would depend on whether a post-URI is present (if so, send XML formatted data, if not send CGI formatted data). The WG will provide more information on the relationship to forms in the next working draft.
HTML forms are only one specific instance of a more general issue, how does P3P relate to other protocols referenceable by a URI including FTP, channels, telnet, email, etc.
We need to properly resolve/coordinate requirements and dependencies on RDF syntax and XML name-spaces.

Appendix 7: Working Group Contributors

Lorrie Cranor	AT&T
Philip DesAutels	Matchlogic
Melissa Dunn	Microsoft
Daniel Jaye (Editor)	Engage Technologies
Kevin Jones	Intermind
Steve Lucas (Chair)	Matchlogic
Massimo Marchiori (Editor)	W3C
Maclen Marvit	Narrowline
Max Metral	Firefly Network Inc.
Paul Perry	Firefly Network Inc
Martin Presler-Marshall	IBM
Joseph Reagle (Project Manager)	W3C
Drummond Reed	Intermind

This work was based on other documents, specifically the

as well as the :

Scenario	Existing Agreement	New Proposal	PUID Requested	Data Requested	Data Written
1	No	Yes	Yes	-	-
2	Yes	No	No	-	-
3	Yes	Yes	Yes	-	-
4	-	-	-	Yes	No
5	-	-	-	Yes	Yes

Scenario	Proposal-URI	Post-URI	Proposal Embedded in Content
6	Yes	-	No
7	No	-	Yes
8	-	Yes	-

Message	Meaning	After Receiving	Expected Response	Data in Message	Optional in Message	HTTP Return Code
OK	Proposal acceptable or data transfer successful	PROP or TXD	none	MD5 hash of agreement or data transferred		2xx
PROP	Here's a Proposal	Any time	OK, SRY, or PROP	Text of a proposal	Signature of initiator, fingerprint of previous Proposal	409
SRY	Sorry - request not processed	PROP, TXD	PROP or none	Reason code and MD5 hash of proposal or data transferred	Which practices are unacceptable	409
TXD	Transfer Data	Any time	none, OK or SRY	Data element names and values to be written, as requested	Agreement	2xx

[22]	long-proposal	=	"<proposal" ("AGREEMENTID=") quoted-string ["AGREXP=" datetime] ;default is six months "REALM=" `"` URI (" " URI) `"` "ENTITY=" quoted-string [("ASSURANCE=") `"` URI `"`] ["postURI=" `"` URI `"`] ">" 1statement-block disclosure "</proposal>"
[23]	schemaURI	=	<"> "http://www.w3.org/TR/1998/WD-P3P10-harmonization#proposal.DTD" <">
[24]	quoted-string	=	<UTF-8 quoted string>
[25]	URI	=	<quoted URI as per RFC 2068 [URI]>
[26]	datetime	=	< date/time as per section 3.3 of RFC 2068>

Web.	Category	Type	Short display name
PUID	Unique Identifiers	Number	Pairwise or Site ID
TUID	Unique Identifiers	Number	Temporary or Session ID

Web.Abstract.	Category	Type	Short display name
ClientClickStream	Navigation and Click-stream Data, *	Boolean	Click-stream collected on the server.
ServerClickStream	Navigation and Click-stream Data, *	Boolean	Click-stream collected on the client.
StoreNegotiation	Transaction Data	Boolean	Server stores the negotiation history
Form	*	Boolean	Data entered through forms
Form.SearchText	Transaction Data	Boolean	Search terms

User.	Category	Type	Short display name
Name.	Physical Contact Information, Demographic and SocioEconomic Data	PersonName.	User's Name
Bdate.	Demographic and SocioEconomic Data	Date.	User's Birth Date
Cert	Unique Identifiers	Text	User's Identity Certificate
Gender	Demographic and SocioEconomic Data	Gender	User's Gender
Employer	Demographic and SocioEconomic Data	Text	Name of user's employer
Department	Demographic and SocioEconomic Data	Text	Department or division of organization where user is employed
JobTitle	Demographic and SocioEconomic Data	Text	User's job title
Home.	Physical Contact Information, Online Contact Information, Demographic and SocioEconomic Data	Info.	User's Home Contact Information
Business.	Physical Contact Information, Online Contact Information, Demographic and SocioEconomic Data	Info.	User's Business Contact Information
BillTo.	Physical Contact Information, Online Contact Information, Demographic and SocioEconomic Data	Info.	User's Billing Address
ShipTo.	Physical Contact Information, Online Contact Information, Demographic and SocioEconomic Data	Info.	User's Shipping Address

Date.	Category	Type	Short display name
Year	Demographic and SocioEconomic Data	Number	Year
Month	Demographic and SocioEconomic Data	Number	Month
Day	Demographic and SocioEconomic Data	Number	Day
Hour	Demographic and SocioEconomic Data	Number	Hour
Minute	Demographic and SocioEconomic Data	Number	Minute
Second	Demographic and SocioEconomic Data	Number	Second
FractionSecond	Demographic and SocioEconomic Data	Number	Fraction of Second
TimeZone	Demographic and SocioEconomic Data	Text	Time Zone

PersonName.	Category	Type	Short display name
Prefix	Demographic and SocioEconomic Data	Text	Name Prefix
First	Physical Contact Information	Text	First Name
Last	Physical Contact Information	Text	Last Name
Middle	Physical Contact Information	Text	Middle Name
Suffix	Demographic and SocioEconomic Data	Text	Name Suffix
Nickname	Demographic and SocioEconomic Data	Text	Nickname

PhoneNum.	Category	Type	Short display name
IntCode	Physical Contact Information	Number	International Phone Code
LocCode	Physical Contact Information	Number	Local Phone Area Code
Number	Physical Contact Information	Number	Phone Number
Ext	Physical Contact Information	Number	Phone Extension
Comment	Physical Contact Information	Text	Phone Optional Comments

Info.	Category	Type	Short display name
Postal.	Physical Contact Information, Demographic and SocioEconomic Data	Postal.	Postal Address Information
Telecom.	Physical Contact Information	Telecom.	Telecommunications Information
Online.	Online Contact Information	Online.	Online Address Information

Postal.	Category	Type	Short display name
Name.	Physical Contact Information, Demographic and SocioEconomic Data	PersonName.	Name
Street	Physical Contact Information	Text	Street Address
City	Physical Contact Information	Text	City
StateProv	Physical Contact Information	Text	State or Province
PostalCode	Physical Contact Information	Text	Postal Code
CountryCode	Physical Contact Information	Country	Country
Country	Physical Contact Information	Text	Country String

Telecom.	Category	Type	Short display name
Phone	Physical Contact Information	PhoneNum.	Phone Number
Fax	Physical Contact Information	PhoneNum.	Fax Number
Mobile	Physical Contact Information	PhoneNum.	Mobile Phone Number
Pager	Physical Contact Information	PhoneNum.	Pager Number

Online.	Category	Type	Short display name
Email	Online Contact Information	Text	Email Address
URI	Online Contact Information	URI	Home Page Address

Primitive DataType	Definition
Text	UTF-8
Gender	Text of character "M" or "F".
Boolean	Text of character "0" or "1".
Binary	Base64 per RFC-1531. [MIME]
Number	Text of characters composed with the digits "0", "1", "2", "3", "4", "5", "6", "7", "8", "9".
UUID	[UUID]
Country	ISO 1366
URI	[URI]