Semantic Conventions for URL

Status: Experimental

This document defines semantic conventions that describe URL and its components.

Attributes

AttributeTypeDescriptionExamplesRequirement LevelStability
url.fragmentstringThe URI fragment componentSemConvRecommendedStable
url.fullstringAbsolute URL describing a network resource according to RFC3986 [1]https://www.foo.bar/search?q=OpenTelemetry#SemConv; //localhostRecommendedStable
url.pathstringThe URI path component [2]/searchRecommendedStable
url.querystringThe URI query component [3]q=OpenTelemetryRecommendedStable
url.schemestringThe URI scheme component identifying the used protocol.https; ftp; telnetRecommendedStable

[1]: For network calls, URL usually has scheme://host[:port][path][?query][#fragment] format, where the fragment is not transmitted over HTTP, but if it is known, it SHOULD be included nevertheless. url.full MUST NOT contain credentials passed via URL in form of https://username:password@www.example.com/. In such case username and password SHOULD be redacted and attribute’s value SHOULD be https://REDACTED:REDACTED@www.example.com/. url.full SHOULD capture the absolute URL when it is available (or can be reconstructed). Sensitive content provided in url.full SHOULD be scrubbed when instrumentations can identify it.

[2]: Sensitive content provided in url.path SHOULD be scrubbed when instrumentations can identify it.

[3]: Sensitive content provided in url.query SHOULD be scrubbed when instrumentations can identify it.

Sensitive information

Capturing URL and its components MAY impose security risk. User and password information, when they are provided in User Information subcomponent, MUST NOT be recorded.

Instrumentations that are aware of specific sensitive query string parameters MUST scrub their values before capturing url.query attribute. For example, native instrumentation of a client library that passes credentials or user location in URL, must scrub corresponding properties.

Note: Applications and telemetry consumers should scrub sensitive information from URL attributes on collected telemetry. In systems unable to identify sensitive information, certain attribute values may be redacted entirely.