Microsoft Researchers Are Teaching AI to Read Spreadsheets

Tech News

brian

July 19, 2024

Microsoft Researchers Are Teaching AI to Read Spreadsheets

Why are spreadsheets a challenge for LLMs?

Spreadsheets are a challenge for LLMs for several reasons.

Spreadsheets can be very large, exceeding the number of characters a LLM can digest at one time.
Spreadsheets are “two-dimensional layouts and structures,” as the report puts it, as opposed to the “linear and sequential input” LLMs work well with.
LLMs aren’t usually trained to interpret cell addresses and specific spreadsheet formats.

Microsoft researchers used multiple-step technique to parse spreadsheets

There are two main parts of SpreadsheetLLM:

SheetCompressor, which is a framework to shrink spreadsheets down into formats LLMs can understand.
Chain of Spreadsheet, which is a methodology for teaching a LLM how to identify the right parts of a compressed spreadsheet to “look at” when presented with a question and for generating a response.

A diagram of how the SpreadsheetLLM framework “reads” a spreadsheet by performing multiple processes. Image: Microsoft

SheetCompressor has three modules:

Structural anchors that help LLMs identify the rows and columns in the spreadsheet.
A method for reducing the number of tokens it costs for the LLM to interpret the spreadsheet.
A technique for improving efficiency by clustering similar cells together.

Using these modules, the team reduced the tokens needed for spreadsheet encoding by 96%. This, in turn, enabled a slight (12.3%) improvement over another leading research team’s work into helping LLMs understand spreadsheets. The researchers tried their spreadsheet identification method with these LLMs:

OpenAI’s GPT-4 and GPT-3.5.
Meta’s Llama 2 and Llama 3.
Microsoft’s Phi-3.
Mistral AI’s Mistral-v2.

For the Chain of Spreadsheet capabilities, they used GPT-4.

What does SpreadsheetLLM mean for Microsoft’s AI efforts?

The obvious advantage for Microsoft here is in enabling its AI assistant Copilot, which works in many Microsoft 365 suite applications, to do more in Excel. SpreadsheetLLM represents the ongoing effort to make generative AI practical – and opening up Excel to people who haven’t been trained on its more advanced features might be a good niche for generative AI to expand into.

SEE: How deeply your business engages with Microsoft Copilot will affect which – if any – version is right for your work.

Real-world usage and next steps for this Microsoft research

A 12.3% improvement over a previous, leading research team’s findings is more academically significant than economically significant for now. Generative AI is infamous for making things up, and hallucinations cascading through a spreadsheet could render huge swaths of data useless. As the researchers point out, getting an LLM to understand a spreadsheet’s format – that is, what a spreadsheet usually looks like and how it functions – is different from getting the LLM to generate comprehensible, accurate data inside those cells.

In addition, this methodology takes a lot of computing power and multiple passes through a LLM to generate an answer. Plus, your office’s Excel wizard might be able to pull an answer in a few minutes without using nearly as much energy.

Going forward, the research team wants to include a way to encode details like the background color of cells and to deepen the LLMs’ understanding of how words within the cells relate to one another.

TechRepublic has reached out to Microsoft for more information.

Source link

CrowdStrike Outage Disrupts Microsoft Systems Worldwide

Achieve Steady SEO Growth with this Bundle for Just $30

Microsoft Researchers Are Teaching AI to Read Spreadsheets

Why are spreadsheets a challenge for LLMs?

Microsoft researchers used multiple-step technique to parse spreadsheets

What does SpreadsheetLLM mean for Microsoft’s AI efforts?

Real-world usage and next steps for this Microsoft research

brian

Categories