
Get classification risk scores on tabular tasks using LLMs.
Folktexts is a python package to evaluate statistical properties of LLMs as classifiers. It enables computing and evaluating classification risk scores for tabular prediction tasks using LLMs.
Several benchmark tasks are provided based on data from the American Community Survey. Namely, each prediction task from the popular folktables package is made available as a natural-language prompting task.
Release Date: | 19 July 2024 |
licence_type: | The MIT License |
Repository: | https://github.com/socialfoundations/folktexts |