About Me

Hi, I’m Jaylen Jones, a third-year PhD student in Computer Science & Engineering at The Ohio State University (OSU) advised by Prof. Huan Sun and Prof. Eric Fosler-Lussier. Prior to joining OSU, I received my B.S. in Computer Science from the College of Informatics at Northern Kentucky University (NKU).

🚀 Seeking a research internship for Summer 2026, feel free to reach out if you have any opportunities!

Research Interests

My research centers on AI with a focus in natural language processing, the alignment of large language models (LLMs) to human values, and LLM-based agents. I am particularly interested in the design, evaluation, and applications of LLM-based systems for high-stakes, human-centered applications, unlocking the full potential of AI capabilities while ensuring the trustworthiness, robustness, and safety required for real-world use.

My current work focuses on the following high-level area:

Evaluating and mitigating the risks of computer-use agents, with an emphasis on both security (i.e., protecting agents from adversarial attack) and safety (i.e., preventing accidental agent harms emerging from typical benign inputs).

Publications & Papers

RedTeamCUA: Towards Realistic Adversarial Testing of CUAs in Hybrid Web-OS Environments (Oral)
Paper · Website · Code
Zeyi Liao*, Jaylen Jones*, Linxi Jiang*, Eric Fosler-Lussier, Yu Su, Zhiqiang Lin, Huan Sun
(* denotes equal contribution)
The Fourteenth International Conference on Learning Representations
(ICLR 2026)
When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents
Paper · Website · Code · Data
Jaylen Jones*, Zhehao Zhang*, Yuting Ning, Eric Fosler-Lussier, Pierre-Luc St-Charles, Yoshua Bengio, Dawn Song, Yu Su, Huan Sun
(* denotes equal contribution)
(arXiv 2026)
When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents
Paper · Website · Code · Data
Yuting Ning, Jaylen Jones, Zhehao Zhang, Chentao Ye, Weitong Ruan, Junyi Li, Rahul Gupta, Huan Sun
(arXiv 2026)
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts
Paper
Vishal Kumar, Zeyi Liao, Jaylen Jones, Huan Sun
(arXiv 2025)
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
Paper
Jaylen Jones, Lingbo Mo, Eric Fosler-Lussier, Huan Sun
2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics
(NAACL 2024)

Awards and Experience

Attended 2025 Conference on Language Modeling to present at COLM 2025 Workshop on AI Agents: Capabilities and Safety, 2025
Lead Student Writer on accepted Open Philanthropy - Call for AI Safety Research grant, 2025
Led as a Student Presenter at the Center for AI Policy’s Congressional Exhibition for Advanced AI, 2025
Lead Student Writer on accepted Schmidt Sciences’ Safety Science Initiative grant, 2024
Attended the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 2024
Inaugural member of the L.I.F.E Foundation Fellowship at NKU, 2018