Wikidata

From Bhikitia Pro, An open encyclopedia
Revision as of 00:09, 28 December 2024 by Duyel (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a common source of open data that Wikimedia projects such as Wikipedia, and anyone else, is able to use under the CC0 public domain license. Wikidata is a wiki powered by the software MediaWiki, including its extension for semi-structured data, the Wikibase. As of mid-2024, Wikidata had 1.57 billion item statements (semantic triple).[1][2][3][4]

Concept

Wikidata is a document-oriented database, focusing on items, which represent any kind of topic, concept, or object. Each item is allocated a unique, persistent identifier, a positive integer prefixed with the upper-case letter Q, known as a "QID". Q is the starting letter of the first name of Qamarniso Vrandečić (née Ismoilova), an Uzbek Wikimedian married to the Wikidata co-developer Denny Vrandečić.[5] This enables the basic information required to identify the topic that the item covers to be translated without favouring any language.[6]

Statements

Statements are how any information known about an item is recorded in Wikidata. Formally, they consist of key–value pairs, which match a property (such as "author", or "publication date") with one or more entity values (such as "Sir Arthur Conan Doyle" or "1902"). For example, the informal English statement "milk is white" would be encoded by a statement pairing the property color (P462) with the value white (Q23444) under the item milk (Q8495).

Properties

Each property has a numeric identifier prefixed with a capital P and a page on Wikidata with optional label, description, aliases, and statements. As such, there are properties with the sole purpose of describing other properties, such as subproperty of (P1647).

Properties may also define more complex rules about their intended usage, termed constraints. For example, the capital (P36) property includes a "single value constraint", reflecting the reality that (typically) territories have only one capital city. Constraints are treated as testing alerts and hints, rather than inviolable rules.

Lexemes

In linguistics, a lexeme is a unit of lexical meaning representing a group of words that share the same core meaning and grammatical characteristics.Similarly, Wikidata's lexemes are items with a structure that makes them more suitable to store lexicographical data. Since 2016, Wikidata has supported lexicographical entries in the form of lexemes.

In Wikidata, lexicographical entries have a different identifier from regular item entries. These entries are prefixed with the letter L, such as in the example entries for book and cow. Lexicographical entries in Wikidata can contain statements, senses, and forms. The use of lexicographical entries in Wikidata allows for the documentation of word usage, the connection between words and items on Wikidata, word translations, and enables machine-readable lexicographical data.[7]

Further reading


References

  1. Welcome to wikidata!Now what? Chalabi, Mona (26 April 2013).
  2. Wikidata Official Website Wikidata.
  3. Wikidata goes live worldwide The H. 25 April 2013.
  4. Wikidata live on the English Wikipedia Wikimedia Deutschland.Pintscher, Lydia (13 February 2013)
  5. Vrandečić, Denny; Pintscher, Lydia; Krötzsch, Markus (30 April 2023).Wikidata: The Making Of.Companion Proceedings of the ACM Web Conference 2023.
  6. Cochrane, Euan (30 September 2016). Wikidata as a digital preservation knowledgebase openpreservation.org.
  7. Wikidata query builder query.wikidata.org