Research Scientist Email: jianfengchi(AT)meta(DOT)com [Google Scholar] [Twitter] [LinkedIn] [Meta Homepage] [Github] |
I am a research scientist at GenAI Meta, working on LLM safety problems. Previously, I received my Ph.D. in Computer Science from the University of Virginia in 2022. During my Ph.D. study, I interned at Facebook and Amazon. My research interests broadly lie in machine learning and natural language processing, with a specific focus on trustworthy AI.
* indicates equal contributions, listed in alphabetical order.
♦ indicates students or interns I mentor.
For a full list of my publications/manuscripts, please go to my Google Scholar webpage.
Llama-Guard: LLM-based Input-Output Safeguard for Human-AI Conversations (Featured work by AI at Meta)
Hakan Inan, Kartikeya Upasani, Jianfeng Chi, Rashi Rungta, Krithika Iyer, Yuning Mao, Michael Tontchev, Qing Hu, Brian Fuller, Davide Testuggine, Madian Khabsa.
[pdf]
[blog post]
[Huggingface checkpoint][code]
Where have you been? A Study of Privacy Risk for Point-of-Interest Recommendation
Kunlin Cai♦, Jinghuai Zhang♦, Will Shand, Zhiqing Hong, Guang Wang, Desheng Zhang, Jianfeng Chi, Yuan Tian.
[pdf]
FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods (ICLR 2024)
Xiaotian Han♦, Jianfeng Chi, Yu Chen, Qifan Wang, Han Zhao, Na Zou, Xia Hu.
[pdf][code]
PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English (ACL 2023)
Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian, Kai-Wei Chang.
[pdf][code]
Retrieval Enhanced Data Augmentation for Question Answering on Privacy Policies (EACL 2023)
Md Rizwan Parvez, Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian, Kai-Wei Chang.
[pdf]
Conditional Supervised Contrastive Learning for Fair Text Classification (EMNLP Findings 2022)
Jianfeng Chi, William Shand, Yaodong Yu, Kai-Wei Chang, Han Zhao, Yuan Tian.
[pdf][code]
Towards Return Parity in Markov Decision Processes (AISTATS 2022)
Jianfeng Chi, Jian Shen, Xinyi Dai, Weinan Zhang, Yuan Tian, Han Zhao.
[pdf]
[code]
Understanding and Mitigating Accuracy Disparity in Regression (ICML 2021)
Jianfeng Chi, Yuan Tian, Geoffrey J. Gordon, Han Zhao.
[pdf]
[code]
Intent Classification and Slot Filling for Privacy Policies (ACL 2021)
Wasi Uddin Ahmad*, Jianfeng Chi*, Tu Le, Thomas Norton, Yuan Tian, Kai-Wei Chang.
[pdf]
[code]
[Video]
Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation (NeurIPS 2020)
Jianfeng Chi*, Han Zhao*, Yuan Tian, Geoffrey J. Gordon.
[pdf]
[Poster]
[Slides]
PolicyQA: A Reading Comprehension Dataset for Privacy Policies (EMNLP Findings 2020)
Wasi Uddin Ahmad*, Jianfeng Chi*, Yuan Tian, Kai-Wei Chang.
[pdf]
[code]
Hybrid Batch Attacks: Finding Black-box Adversarial Examples with Limited Queries (USENIX Security 2020)
Fnu Suya, Jianfeng Chi, David Evans, Yuan Tian.
[pdf]
[code]
PC Member/Reviewer:
Conferences:
Journals: