Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Rubio visits Israel to discuss Iran, State Department announced

February 28, 2026

Dell stock soars faster than earnings as it overcomes memory shortage

February 28, 2026

Amazon’s $50 billion stake in OpenAI could boost AI, cloud business

February 28, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » New projects make Wikipedia data more accessible to AI
AI

New projects make Wikipedia data more accessible to AI

adminBy adminOctober 1, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


On Wednesday, Wikimedia Deutschland unveiled a new database that will make Wikipedia’s rich knowledge more accessible to AI models.

Called the Wikidata Embedding Project, the system applies a technique consisting of nearly 120 million entries to existing data on Wikipedia and its sister platforms, a technique that helps computers understand the meaning and relationships between words.

Combined with new support for Model Context Protocol (MCP), a standard that helps AI systems communicate with data sources, this project makes data more accessible to LLMS natural language queries.

The project was carried out by the German branch of Wikimedia in collaboration with neural search company Jina.ai and DataStax, a real-time training DATA company owned by IBM.

Wikidata has been providing machine-readable data from the Wikimedia property for many years, but existing tools now only allow keyword searches, SPARQL queries, and special query languages. The new system works well by providing developers with the opportunity to ground the model with knowledge verified by Wikipedia editors, thanks to a searched generation (RAG) system that allows AI models to draw in external information.

The data is configured to provide important semantic contexts. For example, queriing a database of the word “scientists” creates a list of scientists who worked at Bell Labs. There is also the translation of the word “scientist” into a different language, the image of scientists in the workplace that has cleared Wikimedia, and extrapolation to related concepts such as “researcher” and “scholar.”

The database is published on Toolforge. Wikidata is also holding a webinar for developers of interest on October 9th.

TechCrunch Events

San Francisco
|
October 27th-29th, 2025

This new project is because AI developers are rushing to a high-quality data source that they can use to fine-tune their models. The training system itself is more refined and often assembled as a complex training environment rather than a simple data set, but requires closely curated data to function. The need for reliable data is particularly urgent for deployments that require high accuracy, and some overlook Wikipedia, but that data is significantly more oriented than catch-all datasets like Common Crawl, a large collection of web pages scraped off the entire internet.

In some cases, driving high-quality data can have expensive consequences for AI labs. In August, humanity offered to settle a lawsuit with the group of authors whose works were being used as training material by agreeing to pay $1.5 billion to end allegations of fraud.

In a statement to the media, Wikidata AI Project Manager Philip Saade highlighted his project’s independence from major AI labs or large high-tech companies. “The launch of this embedded project shows that strong AI doesn’t need to be controlled by a small number of companies,” Saadé told reporters. “It could be open, supportive and built to serve everyone.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleTrump Crypto companies plan to expand with tokenized products, debit cards
Next Article Lithium America stock pops as government takes stocks to boost the Nevada project
admin
  • Website

Related Posts

Musk criticized OpenAI in his deposition, saying, “No one committed suicide because of Grok.”

February 28, 2026

Department of Defense to designate humans as supply chain risk

February 28, 2026

Perplexity’s new computer is another bet where users will need a lot of AI models

February 27, 2026

AI music generator Suno reaches 2 million paid members and $300 million in annual recurring revenue

February 27, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Shawn Johnson denies rumors that she is pregnant with fourth child

By adminFebruary 28, 20260

Sean Johnson responds to rumors that he is pregnant with his fourth childDon’t get it…

Lisa Rinna talks reaction to husband Harry Hamlin’s book, Rob Rausch, Traitor

February 28, 2026

Ruby Franke’s son Chad Franke’s burst appendix, surgery

February 28, 2026

Lil Jon’s son Nathan Smith’s cause of death revealed

February 27, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

President Trump wonders why Iran won’t “surrender.” There are many reasons

February 27, 2026

Chris Bagsarian: Police say grandfather was kidnapped from his bed and killed by mistaken identity

February 27, 2026

US embassy says non-essential staff can leave Israel amid potential Iranian attack

February 27, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.