Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

Why have there been so many ski-related fatalities in Europe this year?

February 23, 2026

BAFTA 2026: BBC apologizes for defaming Tourette’s activist

February 23, 2026

5 days left until 2026 Disrupt rates are locked in at their lowest levels

February 23, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » Guide Labs Debuts New Kind of Interpretable LLM
AI

Guide Labs Debuts New Kind of Interpretable LLM

adminBy adminFebruary 23, 2026No Comments4 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


The challenge when discussing deep learning models is often understanding why the model behaves the way it does. Whether it’s xAI’s repeated struggle sessions to fine-tune Grok’s bizarre politics, or ChatGPT’s struggles with sycophants and mundane hallucinations, connecting neural networks with billions of parameters isn’t easy.

Guide Labs, a San Francisco startup founded by CEO Julius Adebayo and chief scientific officer Aya Abdelsalam Ismail, is providing an answer to that question today. On Monday, the company open sourced Steelling-8B, an 8 billion parameter LLM trained on a new architecture designed to make actions easier to interpret. All tokens generated by the model can be traced back to the origin of the LLM’s training data.

This can be as simple as determining which factual references the model cites, or as complex as understanding the model’s understanding of humor or gender.

“If there are a trillion ways to encode gender, and I encode it in a billion of the trillion things that I have, I have to make sure that I can find all the billion things that I encoded. And I have to be able to reliably turn it on and turn it off,” Adebayo told TechCrunch. “You can do that with the current model, but it’s very fragile…It’s kind of one of those holy grail questions.”

Adebayo began this research while completing his PhD at MIT, where he co-authored a widely cited 2020 paper showing that existing methods of understanding deep learning models are unreliable. That work ultimately led to the creation of a new way to build an LLM. Developers insert conceptual layers into the model that classify data into categories that can be tracked. This requires more up-front data annotation, but by leveraging other AI models, we were able to train this model as our largest proof of concept to date.

“The kind of interpretability that people do… is model-based neuroscience, and we turn it on its head,” Adebayo said. “What we’re really doing is designing a model from scratch so that we don’t have to do any neuroscience.”

Image credit: Guide Labs

One concern with this approach is that it may eliminate some of the new behaviors that make LLM so interesting: the ability to generalize in new ways about things that have not yet been trained. Adebayo says that’s still happening in his company’s model. His team tracks what it calls “discovered concepts,” which models discover on their own, much like in quantum computing.

tech crunch event

boston, massachusetts
|
June 9, 2026

Adebayo argues that this interpretable architecture will be something everyone will need. For consumer LLMs, model builders will be able to use these techniques to block the use of copyrighted material and better control output on subjects such as violence and substance abuse. Regulated industries require more controllable LLMs. For example, in the financial industry, models that evaluate loan applicants should consider things like financial record rather than race. There is also a need for interpretability in scientific research, another area where Guide Labs has developed technology. Protein folding is a huge success for deep learning models, but scientists need more insight into why the software comes up with the successful combinations.

“This model shows that training interpretable models is no longer a kind of science, but an engineering problem,” Adebayo said. “We can figure out the science and extend it. There’s no reason why these kinds of models can’t match the performance of frontier-level models with more parameters.”

According to Guide Labs, Steelling-8B can achieve 90% of the power of existing models, but uses less training data thanks to its new architecture. The company’s next steps, which emerged from Y Combinator and raised a $9 million seed round from Initialized Capital in November 2024, are to build a larger model and start offering APIs and agent access to users.

“The way we currently train models is so primitive that democratizing inherent interpretability will actually be good for us as a species in the long run,” Adebayo told TechCrunch. “We’re chasing these models that are going to be super intelligent, so you don’t want something mysterious to you making decisions for you.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticleMexico travel guide: What you need to know as violence erupts over cartel leader’s death
Next Article Cybersecurity stocks fall on fears of human destruction of AI
admin
  • Website

Related Posts

5 days left until 2026 Disrupt rates are locked in at their lowest levels

February 23, 2026

Spotify rolls out AI-powered prompt playlists in UK and other markets

February 23, 2026

Particle’s AI news app listens to podcasts and finds interesting clips so you don’t have to

February 23, 2026

Secretary of Defense summons Antropic’s Amodei over military use of Claude

February 23, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

BAFTA 2026: BBC apologizes for defaming Tourette’s activist

By adminFebruary 23, 20260

The BBC covers the uncomfortable moments at the 2026 BAFTAs. At the Feb. 22 ceremony,…

Medicube’s new no-cast collagen sunscreen is a must-try skincare option

February 23, 2026

History-making Winter Games athletes, moments

February 23, 2026

Ariel Kebbell and Zach Roerig break up: The Vampire Diaries Starz split

February 23, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

Why have there been so many ski-related fatalities in Europe this year?

February 23, 2026

Colombian pro-Trump lawmaker faces questions after son’s ICE detention

February 23, 2026

Mexico travel guide: What you need to know as violence erupts over cartel leader’s death

February 23, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.