Saranya Vijayakumar

Ph.D. Candidate in Computer Science
Carnegie Mellon University
saranyav [at] andrew.cmu.edu

Research Vision

I work on making AI systems more secure, private, and understandable. My research combines formal verification and machine learning to address vulnerabilities in areas like fraud detection, secure code generation, and privacy-preserving protocols. I’m especially focused on identifying and exploiting weaknesses through red teaming and jailbreaks, building tools that help us understand why these systems break, and how to make them safer.

I am fortunate to be advised by Christos Faloutsos and Matt Fredrikson. Previously, I did my undergraduate at Harvard with a joint concentration in computer science and government, working with Cynthia Dwork and Jim Waldo on my thesis. After Harvard, I spent three years as an associate at Goldman Sachs before beginning my PhD.

My AI governance experience includes running National Security Policy at Harvard's Institute of Politics, graduate coursework at the Kennedy School, an internship at Booz Allen Hamilton, and research collaboration with Bruce Schneier at the Berkman Klein Center. At CMU, I served as a teaching assistant for Norman Sadeh's Security, Privacy and Public Policy course and I guest lecture for his AI governance class. I am currently supported by the Department of Defense National Defense Science and Engineering Graduate Fellowship through the Army Research Office.

Core research areas:

AI Security & Privacy Neural-Symbolic Methods Applied ML Systems

Research

Mechanistically Interpreting a Transformer-based 2-SAT Solver

ICML 2025

Nils Palumbo, Ravi Mangal, Zifan Wang, Saranya Vijayakumar, Corina Pasareanau, Somesh Jha

In this work we present a formal framework for mechanistically interpreting neural networks, motivated by abstract interpretation from program analysis. We apply this framework to analyze a Transformer model trained to solve the 2-SAT problem. Through our analysis, we uncover that the model learns a systematic algorithm - first parsing input formulas into clause-level representations in initial layers, then evaluating satisfiability by enumerating possible Boolean variable valuations. Our work provides evidence that the extracted mechanistic interpretation satisfies our proposed formal axioms, demonstrating how the model systematically solves 2-SAT problems.

PDF

Aligned LLMs Are Not Aligned Browser Agents

ICLR 2025

Priyanshu Kumar, Saranya Vijayakumar, Elaine Lau, Tu Trinh, Zifan Wang, Matt Fredrikson

Our work investigates whether LLMs' safety training, which makes them refuse harmful instructions in chat contexts, extends to non-chat agent scenarios, particularly browser agents. We show that while LLMs may refuse harmful requests in chat form, the same LLMs when used as browser agents often fail to maintain these safety refusals. Through extensive testing with BrowserART (Browser Agent Red teaming Toolkit) on 100 browser-based harmful behaviors, we demonstrate that browser agents pursue many harmful behaviors that would be refused in chat contexts. We find that GPT-4o and o1-preview based browser agents pursued 98 and 63 harmful behaviors respectively (out of 100) under certain attack conditions, indicating that chat-based safety training does not sufficiently transfer to agent contexts.

PDF

Grounding Neural Inference with Satisfiability Modulo Theories

NeurIPS 2023 Spotlight

Zifan Wang*, Saranya Vijayakumar*, Kaiji Lu, Vijay Ganesh, Somesh Jha, Matt Fredrikson

Paper Code Talk + Slides

A novel framework combining neural networks with SMT solvers for enhanced reasoning capabilities while maintaining computational efficiency. Achieved 15% improvement in accuracy on complex reasoning tasks.

Neural-Symbolic AI Formal Methods Machine Learning

CallMine: Fraud Detection and Visualization of Million-Scale Call Graphs

CIKM 2023

Mirela Cazzolato, Saranya Vijayakumar, Meng-Chieh Lee, Namyong Park, Catalina Vajiac, Christos Faloutsos

Paper Code

A system for detecting and visualizing fraud patterns in large-scale telecommunication networks, combining advanced graph mining with interactive visualizations. Successfully deployed with a Portuguese telecom provider.

Graph Mining Fraud Detection Visualization

MalCentroid: Tracking Malware Evolution Through Behavioral Primitive Decomposition

Under Review Neurips 2025

Saranya Vijayakumar*, Matt Fredrikson, Christos Faloutsos

This paper introduces MalCentroid, a novel framework for tracking malware evolution through behavioral analysis. While existing malware detection approaches achieve high accuracy on known samples, they typically treat each sample in isolation and rely on surface features that are easily manipulated. MalCentroid addresses both limitations by decomposing malware samples into behavioral primitives extracted from control flow graphs and tracking their evolution through a centroid-based embedding space. The framework maintains multiple behavioral prototypes per malware family, enabling it to track behavioral drift over time and identify truly novel variants. Evaluation on large-scale datasets demonstrates two key advantages: the ability to track malware evolution patterns (showing families exhibiting drift rates up to 0.633 and uncovering parallel evolution across distinct families), and inherent robustness against adversarial manipulation (with most attack vectors achieving less than 5% success rate compared to 97% against image-based approaches). These results suggest that focusing on behavioral primitives rather than surface features provides both better evolution tracking and natural security benefits.

PDF

Benchmarking AI-Generated Code Detection

Under Review KDD 2025

Saranya Vijayakumar, Christos Faloutsos, Matt Fredrikson

A comprehensive evaluation of traditional, transformer-based, and visual-textual approaches for detecting AI-generated code across multiple programming languages.

PDF

Leveraging Large Language Models for Enhanced Membership Inference

Under Review PoPETS 2026

Saranya Vijayakumar, Matt Fredrikson, Norman Sadeh

Talks

LLM Security Vulnerabilities

AI Governance Course (17-416/17-716), March 31, 2025

Applied security analysis techniques
Real-world privacy challenges with LLMs
Jailbreaking and watermarking

Security and Privacy in Practice

Information Security, Privacy & Policy (17-331/631), November 21, 2024

Slides

Applied security analysis techniques
Real-world privacy challenges with LLMs
Jailbreaking and watermarking

AI Security and Governance

AI Governance Course (17-416/17-716), April 3, 2024

Current landscape of AI security challenges
Intersection of technical capabilities and governance frameworks
Emerging threats and mitigation strategies

AI Security, Robustness, and Privacy

Information Security, Privacy & Policy (17-331/631), December 5, 2023

Recording (CMU Internal)

Overview of current challenges in AI security
Discussion of robustness techniques and evaluation
Privacy considerations in modern AI systems

Academic Service

Information Security, Privacy & Policy (17-331/631), Fall 2024

Final Project Judge

Evaluated student projects on security and privacy implementations
Provided technical feedback and industry-relevant insights
Helped assess practical applicability of security solutions

Teaching

Teaching Philosophy

Teaching Statement PDF

I believe in creating an inclusive learning environment that emphasizes practical understanding and critical thinking. My teaching approach combines theoretical foundations with hands-on experience, preparing students for both academic and industry challenges.

Teaching Experience

Information Security, Privacy & Policy (17-331/631)

Fall 2023

Teaching Assistant

Instructors: Norman Sadeh and Hana Habib

Course Highlights:

Masters-level course covering security and privacy fundamentals
Led discussion sections on privacy policies and security frameworks
Mentored student projects in privacy and membership inference analysis

Rapid Prototyping Technologies (15-294) & Intermediate Rapid Prototyping (15-394)

Spring 2023

Teaching Assistant

Instructor: Dave Touretzky

Course Highlights:

Taught both introductory and intermediate prototyping techniques
Supervised hands-on laboratory sessions
Provided technical guidance for student projects

Teaching Development

Eberly Center Future Faculty Program

Participant in Carnegie Mellon's teaching development program

Completed intensive pedagogical training
Developed evidence-based teaching strategies
Created inclusive course design frameworks

Mentorship

High School Students:
Mentored Philip Negrin on AI Code Detection research (project video) and a second student on their research project.
Masters Students at CMU:
Guided a team of 4 students on privacy research analyzing Google's Topics API (USENIX PEPR '24).

Research Projects & Impact

Large-Scale Fraud Detection Systems

TgrApp system visualization interface

Publications & Demonstrations

Interactive Demo at AAAI-23

IEEE Big Data 2022 Publication

Developed novel visualization and detection methods for analyzing million-scale fraud patterns in telecommunication networks, leading to deployed solutions with real-world impact.

Formal Methods in Security

Research Visits & Collaborations

Inria Nancy

Formal verification of the Olvid messaging protocol using ProVerif

Constraint Programming

ACP Summer School, Leuven

Saarbrücken Workshop

National Security Applications

CSET Review: "Cybersecurity Risks of AI-Generated Code" for Georgetown University
Encryption Projects Research
Upcoming work on AI-generated code detection (Under Review at PAKDD 2025)

Algorithmic Fairness & Ethics

Public Impact & Research

Early work on fairness in algorithmic decision-making systems, combining technical analysis with policy implications.

Featured Article in Harvard Political Review

Undergraduate Thesis on Fairness Metrics in ML

Privacy-Preserving Technologies

Topics API Privacy Analysis

Investigating privacy vulnerabilities in Google's Topics API through novel LLM-based approaches

Enhanced membership inference techniques
Novel reidentification methods
Privacy implications for web advertising

Under Review, 2025

Adversarial Privacy Attacks

PEPR 2024 Publication

Novel techniques for evaluating and enhancing privacy protections in modern web APIs

Research Impact

Development of new privacy attack vectors using large language models
Practical implications for web privacy infrastructure
Contributions to privacy-preserving advertising technologies

Service & Leadership

Academic Community Building

90+ student participants; Founded and lead weekly lunch program for women and non-binary PhDs in SCS

30+ Papers reviewed for ICLR, ICML, KDD, and NeurIPS

Outreach & Impact

Tech in the World Fellow

Implemented healthcare technology solutions with Partners in Health, Lima, Peru

CS Education Initiative Lead

Led computer science education programs in Boston public schools

Awards & Recognition

2024

Best Poster Award

NDSEG Annual Fellows Conference

View Award Announcement →

2023

Graduate Fellowship for STEM Diversity (GFSD)

NSA (Declined)

2022 - 2024

National Defense Science & Engineering Graduate Fellowship

Army Research Office

2021 - 2023

Future Faculty Program

Carnegie Mellon University Eberly Center

Research & Industry Experience

Research Scientist, Quantitative Trading

2018 - 2021

Goldman Sachs - New York, NY

Research Contributions

Developed novel statistical methods for analyzing market microstructure, processing 100M+ daily trading records to identify systematic patterns in algorithmic execution
Led research initiatives on latency-sensitive distributed systems, resulting in peer-reviewed internal publications on network optimization
Architected real-time analytics pipelines using Python and KDB/Q, implementing novel algorithms for trade execution optimization

Systems & Infrastructure

Designed distributed computing framework for processing market data streams across multiple data centers
Implemented machine learning models for real-time trade classification and risk monitoring
Built visualization tools for analyzing high-dimensional financial data, later adopted across multiple trading desks

Distributed Systems Machine Learning High-Performance Computing Data Visualization

Data Scientist

2022

Beto O'Rourke Senate Campaign

Technical Leadership

Developed machine learning models for voter behavior prediction using demographic data
Helped organize grassroots campaign's efforts to set up new headquarters with Distribution Team

Research Impact

Identified key counties for high-likelihood impact
Implemented statistical methods for voter outreach effectiveness

Privacy-Preserving ML Large-Scale Data Analysis Causal Inference

Technical Expertise

Core Technologies

Python Java C TensorFlow scikit-learn PyTorch ProVerif Slang

Languages

English

Native

Tamil

Native

Spanish

Professional

Japanese

Limited

Interests

Art Resume

Art Resume PDF