Free Quiz
Write for Us
Learn Artificial Intelligence and Machine Learning
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books
Learn Artificial Intelligence and Machine Learning
No Result
View All Result

Home » MIT researchers teach AI models to interpret charts

MIT researchers teach AI models to interpret charts

Tarun Khanna by Tarun Khanna
June 9, 2026
in Artificial Intelligence
Reading Time: 4 mins read
0
MIT researchers teach AI models to interpret charts

Image Credit: https://news.mit.edu/

Share on FacebookShare on TwitterShare on LinkedInShare on WhatsApp

The latest ChartNet training dataset ought to enhance the accuracy of vision-language models that assist examine business trends or clarify scientific figures.

To hasten and refine decision-making in a quick-paced, worldwide market, enterprises may deploy generative artificial intelligence models to support summarize and clarify the charts that frequently fill market summaries and financial reports.

But even the latest vision-language models sometimes struggle with this task, as it needs a model to clarify visual, numerical, and linguistic understanding. A company that invests in a state-of-the-art model would still acquire inaccurate or incomplete information.

Also Read:

OpenAI release new initiative to help discover and patch open source bugs

Near Absolute Zero, This Transistor Starts Acting Like a Brain Cell

Billionaire Ambani require AI in each call, app, and domestic

Federated Wireless Releases Spectrum AI for Shared Spectrum Networks

To fill this performance gap, researchers from MIT and the MIT-IBM Computing Research Lab developed a complex resource for AI users this is particularly designed to teach vision-language models (VLMs) how to efficiently interpret charts.

They used a novel data generation approach to construct a cutting-edge dataset that consists of more than a million varied charts. The dataset also encodes many visible, linguistic, and numerical components of each chart picture, which permit models to strongly reason about the information in a chart.

The researchers used this dataset, called ChartNet, to train a series of open-source VLMs. Many of those smaller models extensively outperformed orders of magnitude large, industrial models on tasks like data extraction and chart summarization.

By permitting open-source models to surpass their commercial counterparts, ChartNet ought to permit small firms with restricted budgets to more utilize AI. The open-source dataset can be used to improve the capabilities of AI models for tasks like business trend analysis and scientific figure interpretation.

“We evolved ChartNet to be a one-stop shop for chart understanding, overlaying particularly whatever that an AI model and a practitioner who is training that model would possibly want. We hope our work motivates researchers to attain state-of-the-art overall performance with smaller models that don’t need infinite amounts of computation,” stated Jovana Kondic, an MIT electric engineering and computer science (EECS) graduate student and lead author of a paper on ChartNet.

She is joined on the paper by many co-authors from MIT, the MIT-IBM Computing Research Lab, and IBM Research, together with Pengyuan Li, a research staff member at IBM Research; Dhiraj Joshi, a senior scientist at IBM Research; Isaac Sanchez, a software program engineer at IBM Research; Aude Oliva, director of strategic enterprise engagement on the MIT Schwarzman College of Computing, MIT director of the MIT-IBM Computing Research Lab, and a senior research scientist inside the Computer Science and Artificial Intelligence Laboratory (CSAIL); and Rogerio Feris, a principal scientist and manager on the MIT-IBM Computing Research Lab. The studies may be presented at IEEE Computer Vision and Pattern Recognition Conference.

A dataset bottleneck

Researchers have made great strides evolving generative AI models that excel at natural language processing and reasoning about natural images. But less work has targeted on decoding complicated multimodal data contained within charts, Kondic says.

Yet for large and small businesses in particularly every industry, chart understanding is a vital task.

“The finance industry prospers on charts. If vision-language models can extract data out of charts, like descriptions of trends, that allows a lot of workflows that happen downstream,” Joshi stated.

The lack of high-quality training data is a major bottleneck preserving back the development of VLMs that can appropriately interpret charts. Many datasets comprise limited chart images pulled from the internet and often lack the important scale and further information to support a model interpret the underlying data.

“A vision-language model, unlike our brains, may want to see thousands of examples throughout training to reliably understand something as a line chart,” Kondic says.

The researchers sought to surpass those shortcomings by generating synthetic data. Synthetic data are artificially formed via algorithms to mimic the statistical properties of real data.

The ChartNet dataset holds more a million high-quality chart images, along with the corresponding code used to generate each chart, a textual description, and a table that consists of its numerical information. In addition, every datapoint consists of question-and-answer pairs to teach the model how to efficaciously answer questions about the chart image.

“These additional modes of data guide the model to link and align the different pieces of information that the chart image encodes,” Kondic says.

Data generation

To build ChartNet, the researchers formed a two-step, synthetic data generation pipeline.

First, their automated system translates any pre-existing set of chart images into code. Then the system iteratively augments that code to change different aspects of every chart, consisting of chart type, data values, topic, colors, etc.

“We can begin from a single chart that we use as a seed and comes up hundreds of augmentations of it. This is how we have been capable of build a dataset with more than a million diverse images,” Kondic explains.

They also integrated an automatic quality chech process to make certain the synthetic data are high quality. This procedures verifies that the code is executable and rendered chart images are correct and clean.

“We don’t need to just be producing diverse samples. We also need the information to be displayed in a significant way,” she says.

ChartNet also includes a election of chart datapoints annotated by human professionals. This offers access to additional types of charts and helping data that carry validity guarantees.

A practitioner could use the annotated data to fine-tune an existing VLM, further expanding overall performance for a selected application, Joshi adds.

The researchers tested ChartNet by training IBM’s Granite Vision series of models in addition to numerous other open-source models of numerous sizes and comparing them on numerous chart interpretation tasks. The dataset enhanced the accuracy of all models in chart reconstruction, chart data extraction, chart summarization, and chart question answering.

With ChartNet, small open-source models constantly outperformed much larger commercial models.

“A lot of previous training datasets only targeted on answering easy questions about a chart. We tried to go beyond that with ChartNet by generating data that guide all components of sturdy chart understanding,” Kondic says.

In the future, the researchers plan to persist expanding ChartNet by incorporating data with added levels of complexity. They also need to draw on remarks from the research community.

This research was funded, in part, through the MIT-IBM Computing Research Lab.

ShareTweetShareSend
Previous Post

Trump AI Cybersecurity Order Targets Frontier Models and Critical Infrastructure

Next Post

If AI is addictive, where does the responsibility lie? With big tech or its users?

Tarun Khanna

Tarun Khanna

Founder DeepTech Bytes - Data Scientist | Author | IT Consultant
Tarun Khanna is a versatile and accomplished Data Scientist, with expertise in IT Consultancy as well as Specialization in Software Development and Digital Marketing Solutions.

Related Posts

AI data centers simply got a government-mandated rapid lane to the grid
Artificial Intelligence

AI data centers simply got a government-mandated rapid lane to the grid

June 19, 2026
Jensen Huang Calls for New Social Norms as AI Reshapes Society
Artificial Intelligence

Jensen Huang Calls for New Social Norms as AI Reshapes Society

June 19, 2026
The Future of Work Belongs to People Who Master AI
Artificial Intelligence

The Future of Work Belongs to People Who Master AI

June 17, 2026
Even GPT-5 Failed This Human Attention Test
Artificial Intelligence

Even GPT-5 Failed This Human Attention Test

June 17, 2026
Next Post
If AI is addictive, where does the responsibility lie? With big tech or its users?

If AI is addictive, where does the responsibility lie? With big tech or its users?

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

+ 3 = 11

TRENDING

What All Are The Best Open-Source Data Science Projects?

by Tarun Khanna
September 9, 2021
0
ShareTweetShareSend

US order cutting access to Anthropic’s AI models sparks criticism

US order cutting access to Anthropic’s AI models sparks criticism

Image Credit: https://techxplore.com/

by Tarun Khanna
June 15, 2026
0
ShareTweetShareSend

Trump postpones signing artificial intelligence order out of concern it would harm the AI industry

Trump postpones signing artificial intelligence order out of concern it would harm the AI industry

Image Credit: https://techxplore.com/

by Tarun Khanna
May 22, 2026
0
ShareTweetShareSend

Claude For Small Business Brings Agentic AI Into Everyday SMB Workflows

Claude For Small Business Brings Agentic AI Into Everyday SMB Workflows

Image Credit: https://opendatascience.com/

by Tarun Khanna
May 15, 2026
0
ShareTweetShareSend

China is increasingly keeping its best AI talent to itself

China is increasingly keeping its best AI talent to itself

Image Credit: https://techcrunch.com/

by Tarun Khanna
May 27, 2026
0
ShareTweetShareSend

Accelerating Machine Learning Model Deployment with MLOps Tools

machine learning
by Tarun Khanna
November 16, 2024
0
ShareTweetShareSend

DeepTech Bytes

Deep Tech Bytes is a global standard digital zine that brings multiple facets of deep technology including Artificial Intelligence (AI), Machine Learning (ML), Data Science, Blockchain, Robotics,Python, Big Data, Deep Learning and more.
Deep Tech Bytes on Google News

Quick Links

  • Home
  • Affiliate Programs
  • About Us
  • Write For Us
  • Submit Startup Story
  • Advertise With Us
  • Terms of Service
  • Disclaimer
  • Cookies Policy
  • Privacy Policy
  • DMCA
  • Contact Us

Topics

  • Artificial Intelligence
  • Data Science
  • Python
  • Machine Learning
  • Deep Learning
  • Big Data
  • Blockchain
  • Tableau
  • Cryptocurrency
  • NFT
  • Technology
  • News
  • Startups
  • Books
  • Interview Questions

Connect

For PR Agencies & Content Writers:

connect@deeptechbytes.com

Facebook Twitter Linkedin Instagram
Listen on Apple Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
Listen on Google Podcasts
DMCA.com Protection Status

© 2024 Designed by AK Network Solutions

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Artificial Intelligence
  • Data Science
    • Language R
    • Deep Learning
    • Tableau
  • Machine Learning
  • Python
  • Blockchain
  • Crypto
  • Big Data
  • NFT
  • Technology
  • Interview Questions
  • Others
    • News
    • Startups
    • Books

© 2023. Designed by AK Network Solutions