Research

Research Publications

Preprint

Document-as-Image Representations Fall Short for Scientific Retrieval

Ghazal Khalighinejad, Raghuveer Thirukovalluru, Alexander H. Oh, Bhuwan Dhingra

arXiv, 2026

ArXivDoc: an 8,210-document open-domain scientific retrieval benchmark showing text-based representations outperform document-as-image approaches. Contributed data annotation filtering that distilled 4,059 candidates to 547 evidence-grounded queries.

arXiv

Brax humanoid reaching goal with SIGReg-ISO regularization on actor trunk
Self-supervised Reinforcement Learning Representation Learning JAX Simulations

Scaling deep goal-conditioned RL with SIGReg-ISO

2026

Applies Sketched Isotropic Gaussian Regularization to the actor trunk of a deep goal-conditioned RL network, preventing representation collapse at scale and improving humanoid evaluation success in Brax.

Article · W&B Report · Code

Sentinel defense system dashboard showing claim analysis metrics, pipeline status, and ticker coverage
Sensitivity Analysis Time Series Analysis Web Scraping

Sentinel

2026

Defense Stock Claim Analyzer. Scrapes tweets about defense stocks, filters out bots, fetches the actual price movement and surrounding news, then labels each claim as exaggerated, accurate, or understated. Tracks account-level credibility to surface who's consistently right and who's full of it.

Website · Code

One Haut Encoded interface showing a style profile and curated favourite clothing grid
Neural Collaborative Filtering Recommendation Systems Real-Time Inference

One Haut Encoded

2025

Fashion recommendation stack on the H&M Personalized dataset: popularity baseline, KNN content-based filtering (product metadata and ResNet50 image embeddings), and neural collaborative filtering. Includes a static demo and FastAPI backend for live inference.

Website · Code

Drone input image (top) and LeJEPA depth prediction in inferno colormap (bottom)
LeJEPA Self-Supervised Learning Production API

LeDEEP: monocular depth estimation with LeJEPA

2026

ViT-Small encoder trained with LeJEPA multi-view self-supervised learning and SIGReg regularization on 1280×720 drone footage (DDOS dataset), served via a Flask + Gunicorn REST API with sub-second inference on arbitrary images.

Article · Code

World-model imagination rollout in the Crafter environment
Web App Reinforcement Learning & World Models

Crafter rollout collector

2026

Collect human and agent Crafter rollouts in the browser, demo trained policies with overlays, and compare world-model imagination to live frames (FastAPI + React; Railway + Vercel).

Code · Article

LLM interpretability AI Safety Steering

Steering Mechanisms to Mitigate Sycophancy

2026

Linear probes on MHA, MLP, and residual streams for sycophancy (TruthfulQA) and assertiveness (epistemic-integrity), with activation steering and R² head maps.

Code · Article

CLIP attack comparison: original vs. Base vs. Large attack on a Patriots Quarterback image
VLM Robustness Adversarial AI

Attack Embeddings

2024

Explores adversarial attacks and defenses around multimodal embeddings (e.g. OpenAI CLIP ViT-L/14). Replace with a precise one-sentence summary when ready.

Code

From the blog

Steering experiments to reduce sycophancy (linear probes on LLM activations)