Projects
A collection of projects I've worked on. Click to see details.
MOSAIC: Exploiting Compositional Blindness in Multimodal Alignment
A novel multimodal jailbreak framework that targets compositional blindness in Vision-Language Models. MOSAIC rewrites harmful requests into Action–Object–State triplets, renders them as stylized visual proxies, and induces state-transition reasoning to bypass safety guardrails. Published at NTU ML 2025 Fall Mini-Conference (Oral).
PEFT-STVG: Parameter-Efficient Fine-Tuning for Spatio-Temporal Video Grounding
Spatio-Temporal Video Grounding (STVG) localizes objects in video frames that match natural language queries across time. While effective, existing methods require full model fine-tuning, creating significant computational bottlenecks that limit scalability and accessibility.
CPF-Net: Continuous Perturbation Fusion Network for Weather-Robust LiDAR Segmentation
A continuous perturbation fusion network for weather-robust LiDAR segmentation, achieving SOTA performance on the SemanticKITTI -> SemanticSTF dataset.
Fine-tune LLAVA on Autonomous Driving (ECCV 2024 Challenge)
PREVISION: PRe-training Enhanced Versatile Integration of Semantics, Images, and Object Detection for Novel Corner Case Analysis in Autonomous Driving, NTU DLCV Fall 2024 Final Project | ECCV 2024 Autonomous Driving Challenge
Preference-Guided Meta-RL
Preference-Guided Meta-RL is a framework for learning policies that maximize user preferences using reinforcement learning.
JingleFace
An AI-powered Christmas campaign app that achieved 10K+ user interactions, delivering real-time Pixar-style image transformations through a fine-tuned diffusion model.
PicCollage.com
Developed and maintained the official PicCollage website, improving page load speed and supporting over 100K+ monthly active users.
PicCollage Company Website
Developed and maintained the official PicCollage Company Website, improving user experience and supporting over 60K+ monthly active users.