CDX Internet Archive Index

Name: CDX Internet Archive Index


Description: A CDX file consists of individual lines of text, each of which summarizes a single web document. The first line in the file is a legend for interpreting the data, and the following lines contain the data for referencing the corresponding pages within the host. The first character of the file is the field delimiter used in the rest of the file. This is followed by the literal "CDX". For signature strength we currently assume the field delimiter will be a space character, however please contact the PRONOM team should you encounter CDX index files where the delimiter is different.

Deprecated: false


PUID: fmt/869

sameAs : PRONOM:

Extension: cdx

Magic: true

Container Magic: false

Binary Magic: true

Signature Priority Over:

See Also (e.g. Wikidata, Library of Congress):

Software that can read the format: