===Natural Language Queries to Wikidata: A Naive Prototype===
===Natural Language Queries to Wikidata: A Naive Prototype===
Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023/Natural_Language_Queries_to_Wikidata:_A_Na%C3%AFve_Prototype a talk] ([https://github.com/rti/askwikidata code] [https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM slides] [https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH- try it]) about querying Wikibase with an LLM. Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place:
Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave [https://www.semantic-mediawiki.org/wiki/SMWCon_Fall_2023/Natural_Language_Queries_to_Wikidata:_A_Na%C3%AFve_Prototype a talk] ([https://github.com/rti/askwikidata code] [https://docs.google.com/presentation/d/1YgDmcvoXaqnYdRyX5RxewVkeioEJ92nb8Sfb_halBsM slides] [https://colab.research.google.com/drive/1yRZshpNj0kXwY0XuUYw5ziqjw_RffxH- try it]) about querying Wikibase with an LLM. Slides 9-22 go from the application architecture to the 'tada' moment.
Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place:
#[[File:Architecture - Ask Wikidata SMWCon 2023.png|alt=Application architecture|thumb|architecture]]Outdated information
#[[File:Architecture - Ask Wikidata SMWCon 2023.png|alt=Application architecture|thumb|architecture]]Outdated information
Revision as of 22:41, 20 December 2023
Semantic MediaWiki is one of the largest, and most complex extensions to MediaWiki - and also an indespensible one for enterprise use. The features it provides are partly described on the Metadata page.
One major advancement was the fact that Bernard Krabina opened ties with Open Collective so that individuals and organizations can donate money to the project.
Task tracking
HalloWelt! combines four extensions they created to make useful task tracking in (Semantic) MediaWiki
Extension:SimpleTasks Tasks are checklist items that can be checked on or off to indicate if the task is open or completed.
Yaron Koren gave a great presentation (slides) called Enhanced Wikibase on how Wikibase (and therefore Wikidata) are missing features. He showed how he implemented these missing features in a series of developments. One is showcased at Wikidata Walkabout - a drill-down and query interface to Wikibase sites; powered by Anvesha - a JavaScript library.
Natural Language Queries to Wikidata: A Naive Prototype
Robert Timms - Sr. Software Engineer Wikibase Suite, Wikimedia Deutschland gave a talk (codeslidestry it) about querying Wikibase with an LLM. Slides 9-22 go from the application architecture to the 'tada' moment.
Not the goal of the talk, but he revealed some of the key drawbacks of using "AI" in the first place:
architectureOutdated information
Prone to hallucinations
No sources (AI doesn't tell you how or why it claims to be authoritative.)
This is supposed to be addressed in part by using the RAG technique.
The 'gpt' in ChatGPT stands for "Generative Pre-trained Transformer" - or a fancy way to say "guess". The artificial intelligence of large language model GPTs guess what you would say next based on the prompt given and the dataset they are trained on. In OpenAI's own words: "Generative AI models formulate responses by matching patterns or words, while RAG systems retrieve data based on similarity of meaning or semantic searches."