Free Quiz
Write for Us
Learn Artificial Intelligence and Machine Learning
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
Learn Artificial Intelligence and Machine Learning
No Result
View All Result

Home » Teaching AI models what they don’t know

Teaching AI models what they don’t know

Tarun Khanna by Tarun Khanna
June 5, 2025
in Machine Learning
Reading Time: 4 mins read
0
Teaching AI models what they don’t know

Photo Credit:https://news.mit.edu/

Share on FacebookShare on TwitterShare on LinkedInShare on WhatsApp

A team of MIT researchers founded Themis AI to quantify AI model uncertainty and cope with knowledge gaps.

Artificial intelligence structures like ChatGPT offers plausible-sounding solutions to any query you would possibly ask. But they don’t usually reveal the gaps in their knowledge or regions where they’re unsure. That problem can have large outcomes as AI systems are increasingly more used to do things like develop drugs, synthesize records, and drive independent vehicles.

Now, the MIT spinout Themis AI is supporting quantify models uncertainty and correct outputs before they cause bigger problems. The enterprise’s Capsa platform can work with any machine-learning model to hit upon and correct unreliable outputs in seconds. It works via modifying AI models to access them to detect patterns of their statistics processing that imply ambiguity, incompleteness, or bias.

Also Read:

Making AI-generated code more correct in any language

The Rise of AI: Leading Computer Scientists anticipate a Star Trek-Like Future

Researchers teach LLMs to solve complex planning challenges

Like human brains, large language models reason about diverse data in a standard way

“The concept is to take a model, wrap it in Capsa, identify the uncertainties and failure modes of the model, and then improve the model ion,” stated Themis AI co-founder and MIT Professor Daniela Rus, who is also the director of the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). “We’re excited about supplying an solution that could enhance model and provide guarantees that the model is working correctly.”

Rus founded Themis AI in 2021 with Alexander Amini ’17, SM ’18, PhD ’22 and Elaheh Ahmadi ’20, MEng ’21, two former research affiliates in her lab. Since then, they’ve assisted telecom corporations with network making plans and automation, helped oil and gas corporations use AI to understand seismic imagery, and published papers on growing more dependable and trustworthy chatbots.

“We want to access AI in the highest-stakes applications of every industry,” Amini stated. “We’ve all seen examples of AI hallucinating or making mistakes. As AI is positioned more widely, those mistakes may want to cause devastating consequences. Themis makes it possible that any AI can forecast and expect its very own failures, before they happen.”

Helping models know what they don’t know

Rus’ lab has been researching model uncertainty for years. In 2018, she acquired funding from Toyota to study the reliability of a machine learning-based autonomous self reliant solution.

“That is a safety-essential context where understanding model reliability is very critical,” Rus stated.

In separate work, Rus, Amini, and their collaborators built an algorithm that could detect racial and gender bias in facial recognition structures and automatically reweight the model’s training statistics, displaying it removed bias. The algorithm worked by figuring out the unrepresentative parts of the underlying training statistics and generating new, similar data samples to rebalance it.

In 2021, the eventual co-founders showed a similar approach can be used to assist pharmaceutical enterprises use AI models to are expecting the properties of drug applicants. They founded Themis AI later that year.

“Guiding drug discovery may potentially save a lot of money,” Rus stated. “That was the use case that made us realize how powerful this tool will be.”

Today Themis AI is working with corporations in numerous industries, and lots of those corporations are constructing building language model. By the using of Capsa, these models are able to quantify their own uncertainty for each output.

“Many corporations are interested about the usage of LLMs which can be primarily based on their statistics, however they’re concerned about reliability,” observes Stewart Jamieson SM ’20, PhD ’24, Themis AI’s head of technology. “We assist LLMs self-report their self assurance and uncertainty, which enables more reliable question answering and flagging unreliable outputs.”

Themis AI is also in discussions with semiconductor corporations building AI solutions on their chips that can work outside of cloud environments.

“Normally those smaller models that work on phones or embedded systems aren’t very accurate compared to what you can run on a server, however we are able to get the best of each worlds: low latency, efficient edge computing without sacrificing quality,” Jamieson explains. “We see a future where edge devices do most of the work, but however whenever they’re unsure of their output, they are able to forward those tasks to a primary server.”

Pharmaceutical corporations also can use Capsa to enhance AI models getting used to identify drug candidates and expect their performance in clinical trials.

“The predictions and outputs of these models are very complex and hard to interpret — professionals spend quite a long time and effort seeking to make sense of them,” Amini remarks. “Capsa can give insights right out of the gate to understand if the predictions are backed by way of evidence in the training set or are just speculation without numerous grounding. That can boost up the identification of the strongest predictions, and we suppose that has a huge ability for societal good.”

Research for effect

Themis AI’s team believes the corporation is properly-positioned to enhance the edge of constantly evolving AI technology. For example, the corporation is exploring Capsa’s ability to improve accuracy in an AI technique referred to as chain-of-thought reasoning, in which LLMs explain the steps they take to get to an solution.

“We’ve seen signs Capsa should assist guide those reasoning strategies to identify the highest confidence chains of reasoning,” Jamieson stated. “We suppose that has large implications in phrases of enhancing the LLM experience, lowering latencies, and lowering computation necessities. It’s an extremely high-effect opportunity for us.”

For Rus, who has co-founded numerous corporations since coming to MIT, Themis AI is an opportunity to make sure her MIT research has effect.

“My students and I have become increasingly obsessed on going the extra step to make our work relevant for the world,” Rus says. “AI has fantastic ability to transform industries, however AI also increases issues. What excites me is the opportunity to assist expand technical solutions that cope with those challenges and also build trust with and understanding among people and the technologies which are turning into a part of their day by day lives.”

ShareTweetShareSend
Previous Post

Tackling hallucinations: MIT spinout teaches AI to confess when it’s clueless

Next Post

Anthropic launches Claude AI models for US national safety

Tarun Khanna

Tarun Khanna

Founder DeepTech Bytes - Data Scientist | Author | IT Consultant
Tarun Khanna is a versatile and accomplished Data Scientist, with expertise in IT Consultancy as well as Specialization in Software Development and Digital Marketing Solutions.

Related Posts

machine learning
Technology

Accelerating Machine Learning Model Deployment with MLOps Tools

November 16, 2024
Machine Learning Prediction Examples
Machine Learning

Machine Learning Prediction Examples

January 22, 2023
Top 10 Real World Applications of Machine Learning
Machine Learning

Top 10 Real World Applications of Machine Learning

January 20, 2023
Machine-Learning-Role-In-Paraphrasing-Tool
Machine Learning

Machine Learning Role In Paraphrasing Tools To Avoid Plagiarism

June 9, 2022
Next Post
Anthropic launches Claude AI models for US national safety

Anthropic launches Claude AI models for US national safety

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

− 6 = 2

TRENDING

How SSL Encryption Secures Big Data In Cloud Computing?

by Tarun Khanna
April 14, 2022
0
ShareTweetShareSend

Best Free Datasets Resources To Help You In Your Data Science Projects

best-free-datasets
by Tarun Khanna
September 7, 2021
0
ShareTweetShareSend

Working Of Machine Learning In AI Paraphrasing Tools

AI Paraphrasing Tools
by Tarun Khanna
March 31, 2022
0
ShareTweetShareSend

World First: Engineers Train AI at Lightspeed

World First: Engineers Train AI at Lightspeed

Photo Credit: https://scitechdaily.com/ Penn Engineers have created the first programmable photonic chip that can train nonlinear neural networks using light, potentially revolutionizing AI by making it faster and more energy-efficient. Unlike traditional electronic chips, this new chip reshapes light itself to perform complex computations, enabling real-time learning and offering a major step toward fully light-powered computers.

by Tarun Khanna
May 2, 2025
0
ShareTweetShareSend

Data Science Strategy of Oracle

oracle-data-science
by Tarun Khanna
May 10, 2021
0
ShareTweetShareSend

Top 10 Esteemed Big Data Analytics Companies

data-analytics
by Tarun Khanna
May 14, 2021
0
ShareTweetShareSend

DeepTech Bytes

Deep Tech Bytes is a global standard digital zine that brings multiple facets of deep technology including Artificial Intelligence (AI), Machine Learning (ML), Data Science, Blockchain, Robotics,Python, Big Data, Deep Learning and more.
Deep Tech Bytes on Google News

Quick Links

  • Home
  • Affiliate Programs
  • About Us
  • Write For Us
  • Submit Startup Story
  • Advertise With Us
  • Terms of Service
  • Disclaimer
  • Cookies Policy
  • Privacy Policy
  • DMCA
  • Contact Us

Topics

  • Artificial Intelligence
  • Data Science
  • Python
  • Machine Learning
  • Deep Learning
  • Big Data
  • Blockchain
  • Tableau
  • Cryptocurrency
  • NFT
  • Technology
  • News
  • Startups
  • Books
  • Interview Questions

Connect

For PR Agencies & Content Writers:

connect@deeptechbytes.com

Facebook Twitter Linkedin Instagram
Listen on Apple Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
DMCA.com Protection Status

© 2024 Designed by AK Network Solutions

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books

© 2023. Designed by AK Network Solutions