Scholarly metadata, deposited by thousands of our members and made openly available can act as “trust signals” for the publications. It provides information that helps others in the community to verify and assess the integrity of the work. Despite having a central responsibility in ensuring the integrity of the work that they publish, editorial teams tend not be fully aware of the value of metadata for integrity of the scholarly record. How can we change that?
Crossref was created back in 2000 by 12 forward-thinking scholarly publishers from North America and Europe, and by 2002, these members had registered 4 million DOI records. At the time of writing, we have over 23,600 members in 164 different countries. Half of our members are based in Asia, and 35% are universities or scholar-led. These members have registered over 176 million open metadata records with DOIs (as of today). What a difference 25 years makes!
In our 25th anniversary year, I thought it would be time to take a look at how we got here. And so—hold tight—we’re going to go on an adventure through space and time1, stopping every 5 years through Crossref history to check in on our members. And we’re going to see some really interesting changes over the years.
The Frankfurt Book Fair is the largest book fair in the world, and therefore a key event on our calendar. Held annually in Frankfurt, Germany, the 77th Frankfurt Book Fair (October 15–19, 2025) saw 118,000 trade visitors and 120,000 private visitors from 131 countries. The Crossref booth was located, as usual, in Hall 4.0 where all the stands with information about academic publishing can be found. Four Crossref colleagues attended the Book Fair this year, and in this blog post, you can read more about their meetings, experiences, and plans.Â
TL;DR. Metadata Manager will be retired at the end of 2025. Over the past four years, we have been developing a new helper tool to replace it, and that tool has now reached a stage of maturity that means we will be able to switch off Metadata Manager by the end of the year.
Many researchers want to carry out analysis and extraction of information from large sets of data, such as journal articles and other scholarly content. Methods such as screen-scraping are error-prone, place too much strain on content sites and may be unrepeatable or break if site layouts change. Providing researchers with automated access to the full-text content via DOIs and Crossref metadata reduces these problems, allowing for easy deduplication and reproducibility. Supporting text and data mining echoes our mission to make research outputs easy to find, cite, link, assess, and reuse.
In 2013 Crossref embarked on a project to better support Crossref members and researchers with Text and Data Mining requests and access. There were two main parts to the project:
To collect and make available full-text links and publisher TDM license links in the metadata.
To provide a service (TDM click-through service) for Crossref members to post their additional TDM terms and conditions and for researchers to access, review and accept these terms.
To date, 37.5 million works registered with Crossref have both full-text links and TDM license information. We continue to encourage all members to include full-text links and license information in the metadata they register to assist researchers with TDM. You can see how each member is doing via its Participation Report (e.g. Wiley’s).
Members are also making subscription content available for text mining (temporarily or otherwise) for specific purposes, such as to help the research community with its response to COVID-19. Back in April we highlighted how this can be achieved by including:
A “free to read” element in the access indicators section of publisher metadata indicating that the content is being made available free-of-charge (gratis)
An assertion element indicating that the content being made available is available free-of-charge.
To access Crossref’s click-through tool for text and data mining, users could log in via their ORCID iD. They could then review TDM license agreements posted by Crossref members and accept, reject or postpone their decisions until later. Having agreed to a publisher’s terms and conditions this action was logged against the user’s API token which they could use when requesting full-text from the publisher.
Since the pilot in 2014, only 2 publishers have continued with the tool and fewer than 300 API tokens have been issued.
Publishers have since developed their own mechanisms for managing TDM requests. The introduction of UK (2014) / EU (2019) copyright exceptions for TDM has significantly reduced the number of requests and at the same time, more and more content is published under an open access license.
Given the low take-up of the click-through by both publishers and researchers, its goals are no longer being met. Therefore we will retire the TDM click-through in December 2020. Until that date, it will still operate for the two publishers and various researchers who use it while they finish implementing their alternative plans.
Crossref will continue to collect member-supplied TDM licensing information in metadata for individual works, and researchers can continue to find this via the Crossref APIs.