Algorithms and Society Members Publications

Evaluating LLMs as risk scores

Survey evaluation
Accuracy and calibration of LLMs on human prediction tasks

Members

Publications

Social Foundations of Computation Algorithms and Society Conference Paper Evaluating Language Models as Risk Scores Cruz, A. F., Hardt, M., Mendler-Dünner, C. Advances in Neural Information Processing Systems 37 (NeurIPS 2024), The Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS), December 2024 (Published) ArXiv Code URL BibTeX