Projects

Automating Contact Tracing

Github

Utilized graph neural networks to build a model that would attempt to predict contact tracing patterns on a mobility dataset in Austin, TX. One iteration of this project attempted to predict contact between individuals on a static network based on a contact network generated over 5 days of mobility data. Another iteration of this project attempted to predict contact between individuals on a dynamic contact network that would change every day. For the static network, the model achieved an ROC-AUC score of 0.91, and for the dynamic network, the model achieved an ROC-AUC score of 0.79.

PythonGraph Neural NetworksGraph Convolutional NetworksDeep Graph Library (DGL)NetworkX

UT Course Search

GithubWebsite

Built a semantic search engine for UT courses using OpenAI embeddings API and Pinecone Vector Database. Additionally, used LlamaIndexing with ChatGPT API to provide users with a chat-like answer to their queries about courses. Used Next.js to build the frontend, Flask to build to backend, Pinecone for the vector database, and Docker/Google Cloud Platform to deploy the application.

PythonLarge Language ModelsEmbeddingsLlamaIndexingPinecone Vector DatabaseNext.jsDockerGoogle Cloud Platform

Research Paper Summarizer

GithubStreamlit Demo

Finetuned a base T5 model that would take technical abstracts from research papers and generate a readable summary of the paper. The model was finetuned using the HuggingFace transformers library and the Trainer API, and it achieved a ROUGE-1 score of 0.403590 and a ROUGE-2 score of 0.124948. A demo of this was also created using Streamlit.

PythonLarge Language ModelsStreamlitHuggingFacePytorch

Shot Predictor

Github

Built a machine learning pipeline that utilizes deep learning based pose estimation networks to predict the outcome of a free throw in basketball. The model achieved an accuracy of 0.726 and ROC-AUC score of 0.794.

PythonTensorflowScikit-learnPose Estimation

LoFi Music Generator

GithubMedium Article

Used deep learning techniques such as Recurrent Neural Networks (RNNs) and Variational Auto-Encoders (VAEs) to produce a generative model that outputted new LoFI hip-hop music in the form of a MIDI file.

PythonTensorflowDeep LearningGenerative Models

NBA Game Predictor

Github

A side project where I scraped historical NBA game data as well as advanced team statistics and trained a model to predict the result of games in the current season. I used pandas and numpy to clean the data, matplotlib and seaborn to visualize the data, and the Random Forest and Logistic Regression models in Scikit-learn to perform the classifications.

PythonPandasScikit-learnMachine Learning

UT Hardware as a Service

Github

Built a full-stack web application that acts as a HaaS application that allows users to check out hardware resources and manage them. Involved building APIs to perform CRUD operations and a frontend GUI.

PythonJavascriptReactFlaskMongoDB

eHills

Created a multithreaded auctioning service using Java that allowed users to bid on and buy items. Implemented using Java and JavaFX.

Java

Array Simulator

Github

Helped build a simulator in python that models the behavior of solar cells given various external conditions. Implemented algorithms in this simulator to find the maximum power point, and measured the performance of of these algorithms using the simulator.

PythonJSONPyQTGraph

Spotify Rewind

WebsiteGithub

Utilized the spotify API to build a web app in python that allows users to view their most listened to songs and artists over certain periods of time. Additionally, it shows a weighted popularity score for the songs and artists the user listens to. Built using Flask.

PythonFlaskSpotiPy