Works on trustworthy machine learning with focus on uncertainty quantification and AI agent reliability evaluation.
How media typically covers Stephan Rabanser
Referenced in coverage
Despite rapid capability improvements in frontier AI models over 18 months, reliability has barely budged—agents remain inconsistent across runs and poorly calibrated about their own uncertainty, indicating that building capable AI is not the same as building dependable AI.
“Works on trustworthy machine learning with focus on uncertainty quantification and AI agent reliability evaluation.”