About me

I am currently a Postdoctoral Researcher in Prof. Dawn Song’s group at UC Berkeley. Before that, I earned my Ph.D. in Computer Sciences from the University of Wisconsin-Madison, advised by Prof. Sharon (Yixuan) Li. My PhD research aims to pave the way to a reliable Open-world Machine Learning system, covering topics: Out-of-distribution (OOD) Detection, Open-world Representation Learning (ORL), Interpretability.

My current research revolves around the trustworthy Large Language Models (LLMs) on the following topics:

  • Hallucination, Mechanistic Interpretability, Safety

(Latest update: 09/26/2024)

News

  • [9/25/2024] Two papers are acccepted to NeurIPS 2024.
  • [5/1/2024] Two papers are acccepted to ICML 2024.
  • [3/13/2024] One paper is accepted to NAACL 2024.
  • [12/8/2023] One (co)first-authored paper is accepted to AAAI 2024.
  • [9/21/2023] Two papers are accepted to NeurIPS 2023. One is spotlighted.
  • [7/19/2023] Defended my thesis “Detecting and Learning Out-of-Distribution Data in the Open world: Algorithm and Theory”. Finally Dr. Sun!
  • [4/24/2023] One first-authored conference paper is accepted to ICML 2023.
  • [2/27/2023] One first-authored conference paper is accepted to CVPR 2023.
  • [1/4/2023] One first-authored journal paper is accepted to TMLR.