About me

I am currently a Postdoctoral Researcher in Prof. Dawn Song’s group at UC Berkeley. Before that, I earned my Ph.D. in Computer Sciences from the University of Wisconsin-Madison, advised by Prof. Sharon (Yixuan) Li. My PhD research aims to pave the way to a reliable Open-world Machine Learning system, covering topics: Out-of-distribution (OOD) Detection, Open-world Representation Learning (ORL), Interpretability.

My current research revolves around the trustworthy Large Language Models (LLMs) on the following topics:

Hallucination, Mechanistic Interpretability, LLM Agent

(Latest update: 02/27/2025)

News

[2/26/2025] One paper is acccepted to CVPR 2025.
[1/22/2025] Two papers are acccepted to ICLR 2025.
[9/25/2024] Two papers are acccepted to NeurIPS 2024.
[5/1/2024] Two papers are acccepted to ICML 2024.
[3/13/2024] One paper is accepted to NAACL 2024.
[12/8/2023] One (co)first-authored paper is accepted to AAAI 2024.
[9/21/2023] Two papers are accepted to NeurIPS 2023. One is spotlighted.
[7/19/2023] Defended my thesis “Detecting and Learning Out-of-Distribution Data in the Open world: Algorithm and Theory”. Finally Dr. Sun!
[4/24/2023] One first-authored conference paper is accepted to ICML 2023.
[2/27/2023] One first-authored conference paper is accepted to CVPR 2023.
[1/4/2023] One first-authored journal paper is accepted to TMLR.

Yiyou Sun

News