Skip to content(if available)orjump to list(if available)

TaxCalcBench: Evaluating Frontier Models on the Tax Calculation Task

ofrzeta

"Calculating US personal income taxes is a task that requires building an understanding of vast amounts of English text and using that knowledge to carefully compute results. ... Our experiment shows that state-of-the-art models succeed in calculating less than a third of federal income tax returns even on this simplified sample set."

Unsurprisingly. Sometimes I feel like I am in a madhouse. Or in an alchemist's laboratory.

anticensor

Whereas almost every other country tries to make it easier to file taxes, even when the underlying tax schedule is complex.