PhD student at U&I Lab@KAIST
Pinned Loading
-
atomic-persona-evaluation
atomic-persona-evaluation PublicRepository for "Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation" (Findings of ACL 2025)
-
Korean-Abusive-Language-Dataset
Korean-Abusive-Language-Dataset PublicTranslated abusive language dataset (En2Ko). Including OffensEval/AbusEval, CADD, Davidson et al., Waseem&Hovy.
Python
-
RoleConflictBench
RoleConflictBench PublicRepository for "RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity" (Findings of ACL 2026)
Python 3
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.