Who I am
PhD student in Informatics
At Pennsylvania State University, I work on privacy, web-scale text analysis, and responsible AI questions.
PhD Student in Informatics - Pennsylvania State University
PhD student in Informatics at Penn State studying privacy policy analysis, sectoral web classification, and disclosure behavior in AI agents.

Overview
I study privacy disclosures in two settings: the conventional web and emerging agentic systems. The site centers on the datasets, analyses, and evaluation work that connect those settings.
Who I am
At Pennsylvania State University, I work on privacy, web-scale text analysis, and responsible AI questions.
What I study
My work tracks privacy disclosures in websites and asks how those norms shift when AI agents act on a user’s behalf.
Where to start
These projects show the main arc of the site: sector discovery, privacy analysis, and agentic AI evaluation.
Research Map
Three themes organize the current work.
I build datasets and models that recover service sector from website text, making sector context usable for downstream analysis.
I study how disclosures vary across industries and over time, with attention to transparency, vagueness, and sector-specific norms.
I examine what AI agents reveal during tool use and how model behavior and schema design shape oversharing risk.
Selected Work
A few projects that show the main arc of the research.
Problem
Sector context is hard to recover from noisy and heterogeneous website content.
Contribution
SoACer and the SoAC Corpus provide a web-scale classification pipeline and dataset for sector-aware analysis.
Why it matters
They make privacy and governance studies more useful by preserving service context instead of flattening the web.
Problem
Privacy disclosures are difficult to compare across industries and time at scale.
Contribution
My work uses PrivaSeer and sector-aware analyses to study convergence, divergence, and opacity in privacy policies.
Why it matters
This creates empirical grounding for transparency, governance, and data-handling research.
Problem
AI agents may disclose user information too similarly across service contexts when they use tools.
Contribution
My dissertation work treats runtime tool calls as a key disclosure moment and studies both model behavior and schema design.
Why it matters
It extends privacy analysis from the conventional web to agentic systems.
Selected Publications
Selected papers relevant to the homepage. Full publication details are on Google Scholar and the CV.
Builds a dataset and classification pipeline for sector-aware web analysis.
Provides infrastructure for large-scale transparency and privacy-policy analysis.
Earlier work on robust NLP under low-label conditions.
Academic Snapshot
Work on PrivaSeer, sector-aware privacy analysis, and large-scale empirical studies of disclosure behavior.
Dissertation on the evolution of sectoral privacy norms from the conventional web to the agentic web.
Applied NLP methods to qualitative corpora and translated findings into policy-relevant insights for public stakeholders.
Studied fairness and robustness in semi-supervised text classification and built the work that became my master's thesis.
Reviewed work on trustworthy NLP and responsible model behavior.
Spoke with students about research careers and graduate study.
Collaboration and Contact
I welcome conversations about privacy, AI governance, responsible NLP, and agentic systems, especially around research collaboration, datasets and evaluations, contributor work, and research opportunities.