Free Quiz
Write for Us
Learn Artificial Intelligence and Machine Learning
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
Learn Artificial Intelligence and Machine Learning
No Result
View All Result

Home » MIT researchers teach AI models to interpret charts

MIT researchers teach AI models to interpret charts

Tarun Khanna by Tarun Khanna
June 9, 2026
in Artificial Intelligence
Reading Time: 4 mins read
0
MIT researchers teach AI models to interpret charts

Image Credit: https://news.mit.edu/

Share on FacebookShare on TwitterShare on LinkedInShare on WhatsApp

The latest ChartNet training dataset ought to enhance the accuracy of vision-language models that assist examine business trends or clarify scientific figures.

To hasten and refine decision-making in a quick-paced, worldwide market, enterprises may deploy generative artificial intelligence models to support summarize and clarify the charts that frequently fill market summaries and financial reports.

But even the latest vision-language models sometimes struggle with this task, as it needs a model to clarify visual, numerical, and linguistic understanding. A company that invests in a state-of-the-art model would still acquire inaccurate or incomplete information.

Also Read:

How AI can turn out to be more transparent and reliable

New Report Finds AI Data Center Growth Relies on Full Semiconductor Stack

U.S. Moves to Close NVIDIA AI Chip Export Loophole For Chinese Firms Abroad

China is increasingly keeping its best AI talent to itself

To fill this performance gap, researchers from MIT and the MIT-IBM Computing Research Lab developed a complex resource for AI users this is particularly designed to teach vision-language models (VLMs) how to efficiently interpret charts.

They used a novel data generation approach to construct a cutting-edge dataset that consists of more than a million varied charts. The dataset also encodes many visible, linguistic, and numerical components of each chart picture, which permit models to strongly reason about the information in a chart.

The researchers used this dataset, called ChartNet, to train a series of open-source VLMs. Many of those smaller models extensively outperformed orders of magnitude large, industrial models on tasks like data extraction and chart summarization.

By permitting open-source models to surpass their commercial counterparts, ChartNet ought to permit small firms with restricted budgets to more utilize AI. The open-source dataset can be used to improve the capabilities of AI models for tasks like business trend analysis and scientific figure interpretation.

“We evolved ChartNet to be a one-stop shop for chart understanding, overlaying particularly whatever that an AI model and a practitioner who is training that model would possibly want. We hope our work motivates researchers to attain state-of-the-art overall performance with smaller models that don’t need infinite amounts of computation,” stated Jovana Kondic, an MIT electric engineering and computer science (EECS) graduate student and lead author of a paper on ChartNet.

She is joined on the paper by many co-authors from MIT, the MIT-IBM Computing Research Lab, and IBM Research, together with Pengyuan Li, a research staff member at IBM Research; Dhiraj Joshi, a senior scientist at IBM Research; Isaac Sanchez, a software program engineer at IBM Research; Aude Oliva, director of strategic enterprise engagement on the MIT Schwarzman College of Computing, MIT director of the MIT-IBM Computing Research Lab, and a senior research scientist inside the Computer Science and Artificial Intelligence Laboratory (CSAIL); and Rogerio Feris, a principal scientist and manager on the MIT-IBM Computing Research Lab. The studies may be presented at IEEE Computer Vision and Pattern Recognition Conference.

A dataset bottleneck

Researchers have made great strides evolving generative AI models that excel at natural language processing and reasoning about natural images. But less work has targeted on decoding complicated multimodal data contained within charts, Kondic says.

Yet for large and small businesses in particularly every industry, chart understanding is a vital task.

“The finance industry prospers on charts. If vision-language models can extract data out of charts, like descriptions of trends, that allows a lot of workflows that happen downstream,” Joshi stated.

The lack of high-quality training data is a major bottleneck preserving back the development of VLMs that can appropriately interpret charts. Many datasets comprise limited chart images pulled from the internet and often lack the important scale and further information to support a model interpret the underlying data.

“A vision-language model, unlike our brains, may want to see thousands of examples throughout training to reliably understand something as a line chart,” Kondic says.

The researchers sought to surpass those shortcomings by generating synthetic data. Synthetic data are artificially formed via algorithms to mimic the statistical properties of real data.

The ChartNet dataset holds more a million high-quality chart images, along with the corresponding code used to generate each chart, a textual description, and a table that consists of its numerical information. In addition, every datapoint consists of question-and-answer pairs to teach the model how to efficaciously answer questions about the chart image.

“These additional modes of data guide the model to link and align the different pieces of information that the chart image encodes,” Kondic says.

Data generation

To build ChartNet, the researchers formed a two-step, synthetic data generation pipeline.

First, their automated system translates any pre-existing set of chart images into code. Then the system iteratively augments that code to change different aspects of every chart, consisting of chart type, data values, topic, colors, etc.

“We can begin from a single chart that we use as a seed and comes up hundreds of augmentations of it. This is how we have been capable of build a dataset with more than a million diverse images,” Kondic explains.

They also integrated an automatic quality chech process to make certain the synthetic data are high quality. This procedures verifies that the code is executable and rendered chart images are correct and clean.

“We don’t need to just be producing diverse samples. We also need the information to be displayed in a significant way,” she says.

ChartNet also includes a election of chart datapoints annotated by human professionals. This offers access to additional types of charts and helping data that carry validity guarantees.

A practitioner could use the annotated data to fine-tune an existing VLM, further expanding overall performance for a selected application, Joshi adds.

The researchers tested ChartNet by training IBM’s Granite Vision series of models in addition to numerous other open-source models of numerous sizes and comparing them on numerous chart interpretation tasks. The dataset enhanced the accuracy of all models in chart reconstruction, chart data extraction, chart summarization, and chart question answering.

With ChartNet, small open-source models constantly outperformed much larger commercial models.

“A lot of previous training datasets only targeted on answering easy questions about a chart. We tried to go beyond that with ChartNet by generating data that guide all components of sturdy chart understanding,” Kondic says.

In the future, the researchers plan to persist expanding ChartNet by incorporating data with added levels of complexity. They also need to draw on remarks from the research community.

This research was funded, in part, through the MIT-IBM Computing Research Lab.

ShareTweetShareSend
Previous Post

Trump AI Cybersecurity Order Targets Frontier Models and Critical Infrastructure

Next Post

If AI is addictive, where does the responsibility lie? With big tech or its users?

Tarun Khanna

Tarun Khanna

Founder DeepTech Bytes - Data Scientist | Author | IT Consultant
Tarun Khanna is a versatile and accomplished Data Scientist, with expertise in IT Consultancy as well as Specialization in Software Development and Digital Marketing Solutions.

Related Posts

AI Job Disruption Has Not Arrived At Scale Yet
Artificial Intelligence

AI Job Disruption Has Not Arrived At Scale Yet

May 27, 2026
NASA’s Latest AI Processor Is 500x Faster Than Current Space Computers
Artificial Intelligence

NASA’s Latest AI Processor Is 500x Faster Than Current Space Computers

May 26, 2026
Google Launches Co-Scientist, a Multi-Agent AI Partner For Scientific Research
Artificial Intelligence

Google Launches Co-Scientist, a Multi-Agent AI Partner For Scientific Research

May 25, 2026
Trump’s Delayed AI Executive Order Highlights Tension Over AI Security Rules
Artificial Intelligence

Trump’s Delayed AI Executive Order Highlights Tension Over AI Security Rules

May 25, 2026
Next Post
If AI is addictive, where does the responsibility lie? With big tech or its users?

If AI is addictive, where does the responsibility lie? With big tech or its users?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

+ 87 = 96

TRENDING

China’s Zhipu AI launches free AI agent, enhancing domestic tech race

China's Zhipu AI launches free AI agent, intensifying domestic tech race

Photo Credit: https://economictimes.indiatimes.com/

by Tarun Khanna
March 31, 2025
0
ShareTweetShareSend

Study disproves Major Myth: AI’s Energy Usage Is Notably Less Than Feared

Study disproves Major Myth: AI’s Energy Usage Is Notably Less Than Feared

Photo Credit: https://scitechdaily.com/

by Tarun Khanna
December 1, 2025
0
ShareTweetShareSend

Top Trends of Data Analytics and Artificial Intelligence and Data Science in 2021

data-analytics-trends
by Tarun Khanna
May 16, 2021
0
ShareTweetShareSend

Coinbase Expands x402 With AI Agent App Store, Pushing Crypto Payments Into AI Infrastructure

Coinbase Expands x402 With AI Agent App Store, Pushing Crypto Payments Into AI Infrastructure

Image Credit: https://cryptonews.com/

by Tarun Khanna
April 21, 2026
0
ShareTweetShareSend

Artificial Intelligence for Disaster Response: Predicting the Unpredictable

artificial-intelligence-disaster-response
by Tarun Khanna
April 19, 2024
0
ShareTweetShareSend

“AI Is Not Intelligent at All” – Expert Warns of Global Threat to Human Dignity

“AI Is Not Intelligent at All” – Expert Warns of Global Threat to Human Dignity

Photo Credit: https://scitechdaily.com/

by Tarun Khanna
September 2, 2025
0
ShareTweetShareSend

DeepTech Bytes

Deep Tech Bytes is a global standard digital zine that brings multiple facets of deep technology including Artificial Intelligence (AI), Machine Learning (ML), Data Science, Blockchain, Robotics,Python, Big Data, Deep Learning and more.
Deep Tech Bytes on Google News

Quick Links

  • Home
  • Affiliate Programs
  • About Us
  • Write For Us
  • Submit Startup Story
  • Advertise With Us
  • Terms of Service
  • Disclaimer
  • Cookies Policy
  • Privacy Policy
  • DMCA
  • Contact Us

Topics

  • Artificial Intelligence
  • Data Science
  • Python
  • Machine Learning
  • Deep Learning
  • Big Data
  • Blockchain
  • Tableau
  • Cryptocurrency
  • NFT
  • Technology
  • News
  • Startups
  • Books
  • Interview Questions

Connect

For PR Agencies & Content Writers:

connect@deeptechbytes.com

Facebook Twitter Linkedin Instagram
Listen on Apple Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
DMCA.com Protection Status

© 2024 Designed by AK Network Solutions

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books

© 2023. Designed by AK Network Solutions