Documentation

Accessing full texts

For some purposes, such as text and data mining (TDM), you may want to access the full text of a content item. We only collect metadata, but this metadata often contains a URL to access full-text content along with the intended purpose for the URL and the license under which content can be used. Note that full-text URLs may differ from the landing page for the DOI, which often contains only basic information and an abstract.

Examples

Get a single record from the REST API:

https://api-crossref-org.ezproxy.galter.northwestern.edu/v1/works/10.1007/978-3-031-23161-2_300726

Get a list of records, requesting the DOI and any available links:

https://api-crossref-org.ezproxy.galter.northwestern.edu/v1/works?rows=5&select=DOI,link

These results are in JSON format and contain a link field, e.g.

"link": [
  {
    "URL": "https://api-elsevier-com.ezproxy.galter.northwestern.edu/content/article/PII:B9780081026960000208?httpAccept=text/xml",
    "content-type": "text/xml",
    "content-version": "vor",
    "intended-application": "text-mining"
  },
  {
    "URL": "https://api-elsevier-com.ezproxy.galter.northwestern.edu/content/article/PII:B9780081026960000208?httpAccept=text/plain",
    "content-type": "text/plain",
    "content-version": "vor",
    "intended-application": "text-mining"
  }

To retrieve a single record from the XML API:

https://doi-crossref-org.ezproxy.galter.northwestern.edu/search/doi?pid=email@address.com&doi=10.1577/H02-043&format=unixsd

There is a program field with the attribute name="AccessIndicators" containing license information:

<ai:license_ref applies_to="tdm">
  https://www-springernature-com.ezproxy.galter.northwestern.edu/gp/researchers/text-and-data-mining
</ai:license_ref>
<ai:license_ref applies_to="vor">
  https://www-springernature-com.ezproxy.galter.northwestern.edu/gp/researchers/text-and-data-mining
</ai:license_ref>

And a further section collection with a property attribute corresponding to the use. The value of this field is a link to the full text, e.g.:

  <item>
    <resource mime_type="application/pdf">
https://link-springer-com.ezproxy.galter.northwestern.edu/content/pdf/10.1007/978-3-031-23161-2.pdf
    </resource>
  </item>
</collection>

Access and rate limits

Note that the presence of a URL in one of these fields does not guarantee access. You might need a subscription or a login to access the full text content, or you might need to accept a text and data mining license. Such requirements are at the discretion of the Crossref member and are not controlled by Crossref.

Rate limits may apply to websites on which the full text is hosted, and these do not correspond to rate limits on Crossref APIs and websites. We have defined a set of standard HTTPS headers that can be used by servers hosting full-text content to convey rate-limiting information to automated text and data mining tools:

Header nameExplanation
CR-TDM-Rate-LimitMaximum number of full-text downloads that are allowed to be performed in the defined rate limit window.
CR-TDM-Rate-Limit-RemainingNumber of downloads left for the current rate limit window.
CR-TDM-Rate-Limit-ResetRemaining time (in UTC epoch seconds) before the rate limit resets and a new rate limit window is started.

Page maintainer: Martyn Rittman
Last updated: 2025-October-17