File:The Wikidata Query Service Split and its Impact on the Scholarly Graph.pdf

Go to page
next page →
next page →
next page →
Original file (1,239 × 1,752 pixels, file size: 2.44 MB, MIME type: application/pdf, 5 pages)

Summary

Description
English: Wikidata, the open knowledge graph sister to Wikipedia, is undergoing major changes in 2025 as the Wikimedia Foundation splits its data into two graphs. One of the split pieces is essentially the data of the WikiCite project, which is an initiative expanding the use of Wikidata as a platform for scholarly information. Due to WikiCite’s success, it accounts for over 50% of the triples on the Wikidata graph. This split was motivated by challenges in scaling up the Wikidata SPARQL Query Service, which relies on Blazegraph, a technology unsuited for Wikidata’s current scale and growth rate. While the split has been prepared since 2021, this large infrastructure change has wide implications both for community tools which use WikiCite data, such as the Scholia platform, as well as for core functionalities of Wikidata, such as systems that detect duplicate items and constraint violations. In this paper, we present an overview of the infrastructure available on Wikidata for querying scholarly information and how it serves the community endeavours related to the WikiCite initiative. In particular, we focus on the Wikidata Query Service split, its motivations, and its impacts for those intending to use Wikidata as a source of semantic scholarly data. We present the alternatives for rewriting or redirecting broken queries, making explicit the rules of the graph split, and when federated queries are now required, guiding stakeholders on how to adapt to the new infrastructure.
Date
Source

Tiago Lubiana, Lane Rasberry, Daniel Mietchen (2025). The Wikidata Query Service split and its impact on the scholarly graph. In: Joint Proceedings of Posters, Demos, Workshops, and Tutorials of the 21st International Conference on Semantic Systems, co-located with 21st International Conference on Semantic Systems (SEMANTiCS 2025), Vienna, Austria, September 3-5, 2025. Edited by David Chaves-Fraga, Ivan Heibi, Daniel Garijo, Diego Collarana, Angelo Salatino, Sahar Vahdati.

Available via https://ceur-ws.org/Vol-4064/PD-paper3.pdf.
Author Tiago Lubiana, Lane Rasberry, Daniel Mietchen (2025)
Other versions
File:The Wikidata Query Service split and its impact on the scholarly graph - poster at the SEMANTICS 2025 conference.pdf
The poster described in the article

Licensing

w:en:Creative Commons
attribution
This file is licensed under the Creative Commons Attribution 4.0 International license.
You are free:
  • to share – to copy, distribute and transmit the work
  • to remix – to adapt the work
Under the following conditions:
  • attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

Captions

An overview about the Wikidata Query Service Split

10 October 2025

2,559,494 byte

1,752 pixel

1,239 pixel

application/pdf

4caaa845fa3aee222e7489f3ecd29295ca41997e

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current05:52, 13 December 2025Thumbnail for version as of 05:52, 13 December 20251,239 × 1,752, 5 pages (2.44 MB)Daniel MietchenUploaded a work by Tiago Lubiana, Lane Rasberry, Daniel Mietchen (2025) from Tiago Lubiana, Lane Rasberry, Daniel Mietchen (2025). The Wikidata Query Service split and its impact on the scholarly graph. In: Joint Proceedings of Posters, Demos, Workshops, and Tutorials of the 21st International Conference on Semantic Systems, co-located with 21st International Conference on Semantic Systems (SEMANTiCS 2025), Vienna, Austria, September 3-5, 2025. Available via [https://ceur-ws.org/Vol-4064/PD-p...

Global file usage

The following other wikis use this file:

Metadata