I’m currently an undergrad at Stanford University. I'm broadly interested in NLP, robotics, and ML security. Currently, I’m working on reinforcement learning and LLMs at IRIS lab and ML systems at Foundry.
Some things I’ve worked on:
Better aligning LLMs with humans using different forms of preference data as part of IRIS lab
Building out and optimizing fine-tuning as a service at Foundry