It can be difficult to make a generative AI model understand a spreadsheet. In order to try to solve this problem, Microsoft researchers published a paper on July 12 on Arxiv describing SpreadsheetLLM, an encoding framework to enable large language models to “read” spreadsheets.
SpreadsheetLLM could “transform spreadsheet data management and analysis, paving the way for more intelligent and efficient user interactions,” the researchers wrote.
One advantage of SpreadsheetLLM for business would be to use formulas in spreadsheets without learning how to use them by asking questions of the AI model in natural language.
Spreadsheets are a challenge for LLMs for several reasons.
There are two main parts of SpreadsheetLLM:
SheetCompressor has three modules:
Using these modules, the team reduced the tokens needed for spreadsheet encoding by 96%. This, in turn, enabled a slight (12.3%) improvement over another leading research team’s work into helping LLMs understand spreadsheets. The researchers tried their spreadsheet identification method with these LLMs:
For the Chain of Spreadsheet capabilities, they used GPT-4.
The obvious advantage for Microsoft here is in enabling its AI assistant Copilot, which works in many Microsoft 365 suite applications, to do more in Excel. SpreadsheetLLM represents the ongoing effort to make generative AI practical – and opening up Excel to people who haven’t been trained on its more advanced features might be a good niche for generative AI to expand into.
SEE: How deeply your business engages with Microsoft Copilot will affect which – if any – version is right for your work.
A 12.3% improvement over a previous, leading research team’s findings is more academically significant than economically significant for now. Generative AI is infamous for making things up, and hallucinations cascading through a spreadsheet could render huge swaths of data useless. As the researchers point out, getting an LLM to understand a spreadsheet’s format – that is, what a spreadsheet usually looks like and how it functions – is different from getting the LLM to generate comprehensible, accurate data inside those cells.
In addition, this methodology takes a lot of computing power and multiple passes through a LLM to generate an answer. Plus, your office’s Excel wizard might be able to pull an answer in a few minutes without using nearly as much energy.
Going forward, the research team wants to include a way to encode details like the background color of cells and to deepen the LLMs’ understanding of how words within the cells relate to one another.
TechRepublic has reached out to Microsoft for more information.