A digital library and cyberinfrastructure facilitating the discovery and utilization of data & knowledge in published documents

>12,500,000 documents

~200,000 added this month

~50,000 added this week

~8,000 added in the last 24 hours

Enabling TDM

In collaboration with our UW Library staff team members, xDD negotiates agreements with publishers that allow programatic downloading and mining of published content.

All documents are securely stored on an access-controlled server at the heart of our digital library infrastructure (xDD team members and our collaborators do not have access to original content via our infrastructure). UW-Madison's Center for High Throughput Computing supplies the computational power for processing documents using NLP, OCR, and other software tools useful for TDM tasks, which also allows for deploying new tools quickly against all existing documents.

News

Are Researchers Citing Their Data? A Case Study from The U.S. Geological Survey Grace Donovan and Madison Langseth. April 2024. https://datascience.codata.org/articles/10.5334/dsj-2024-024
DARPA Selects Teams to Improve How Scientists Build/Sustain Models, Simulations. OUTREACH@DARPA.MIL. September 2022. https://www.darpa.mil/news-events/2022-09-23
Automated text and data mining: knowledge base creation and augmentation. Shanan Peters. June 2018. https://youtu.be/wzGKFS4IefI?t=8801
What kind of discoveries might be hidden in the growing sea of ‘dark data’? GeoDeepDive might be able to tell us. EarthCube blog. March 2018. https://earthcube.blog/2018/03/06/geodeepdive-into-darkdata/
A New Tool for Deep-Down Data Mining. Eos. 22 Sept. 2017. https://eos.org/project-updates/a-new-tool-for-deep-down-data-mining
UW Digital Humanities Research Network Podcast. Fall 2017. http://dhrn.wiscprintdigital.org/uw-dh-archive/geodeepdive/
Massive, computer-analyzed geological database reveals chemistry of ancient ocean. University of Wisconsin - Madison News. March 2017. https://news.wisc.edu/massive-computer-analyzed-geological-database-reveals-chemistry-of-ancient-ocean/
Scientists search 3 million publications to unlock sea change secret. Engadget. March 2017. https://www.engadget.com/2017/03/30/scientists-search-3-million-publications-to-unlock-sea-change-se/
Computers read the fossil record. Nature News. June 2015. https://www.nature.com/news/computers-read-the-fossil-record-1.17868
Computer equal to or better than humans at cataloging science. University of Wisconsin - Madison News. December 2014. https://news.wisc.edu/computer-equal-to-or-better-than-humans-at-cataloging-science/

End-user Workflow

Have an idea

A question that can be answered by mining the scientific literature. Over 13 million documents are currently available.

Explore the xDD API

Generate summaries of journal coverage, search for and analyze relevant terms from document full text.

Collaborate

We are always looking to support creative new work and to form new collaborations on projects that lead to new scholarly work. All xDD output is licensed under CC-BY-NC.

Get in touch

Whether you're a publisher interested in contributing your content to our infrastructure, a scientist interested in collaboration, or just curious to know more, let us know!

xdd@cs.wisc.edu

The Team

The xDD team is based at the University of Wisconsin - Madison and is made up of domain experts in both the Geosciences and Computer Sciences, librarians, infrastructure developers, with participation of undergraduate, graduate, and postdoctoral researchers