Robustness & Evaluation
Stress-testing models and the benchmarks we judge them by, to surface where they really break.
UC Irvine · Natural Language Processing
We are the Natural Language Processing group at UC Irvine. We study the robustness, interpretability, and reliability of machine learning and NLP — from benchmarks and explanations to neuro-symbolic reasoning and controllable generation.
Stress-testing models and the benchmarks we judge them by, to surface where they really break.
Explaining what models do — and exposing the limits and manipulability of post-hoc explanations.
Bringing logical structure, constraints, and steering to neural models and LLMs.