Projects

Automating Contact Tracing

Python

Graph Neural Networks

Graph Convolutional Networks

Deep Graph Library (DGL)

NetworkX

Utilized graph neural networks to build a model that would attempt to predict contact tracing patterns on a mobility dataset in Austin, TX. One iteration of this project attempted to predict contact between individuals on a static network based on a contact network generated over 5 days of mobility data. Another iteration of this project attempted to predict contact between individuals on a dynamic contact network that would change every day. For the static network, the model achieved an ROC-AUC score of 0.91, and for the dynamic network, the model achieved an ROC-AUC score of 0.79.

UT Course Search

Python

Large Language Models

Embeddings

LlamaIndexing

Pinecone Vector Database

Next.js

Docker

Google Cloud Platform

Built a semantic search engine for UT courses using OpenAI embeddings API and Pinecone Vector Database. Additionally, used LlamaIndexing with ChatGPT API to provide users with a chat-like answer to their queries about courses. Used Next.js to build the frontend, Flask to build to backend, Pinecone for the vector database, and Docker/Google Cloud Platform to deploy the application.

Research Paper Summarizer

Python

Large Language Models

Streamlit

HuggingFace

Pytorch

Finetuned a base T5 model that would take technical abstracts from research papers and generate a readable summary of the paper. The model was finetuned using the HuggingFace transformers library and the Trainer API, and it achieved a ROUGE-1 score of 0.403590 and a ROUGE-2 score of 0.124948. A demo of this was also created using Streamlit.

Shot Predictor

Python

Tensorflow

Scikit-learn

Pose Estimation

Built a machine learning pipeline that utilizes deep learning based pose estimation networks to predict the outcome of a free throw in basketball. The model achieved an accuracy of 0.726 and ROC-AUC score of 0.794.

LoFi Music Generator

Python

Tensorflow

Deep Learning

Generative Models

Used deep learning techniques such as Recurrent Neural Networks (RNNs) and Variational Auto-Encoders (VAEs) to produce a generative model that outputted new LoFI hip-hop music in the form of a MIDI file.

NBA Game Predictor

Python

Pandas

Scikit-learn

Machine Learning

A side project where I scraped historical NBA game data as well as advanced team statistics and trained a model to predict the result of games in the current season. I used pandas and numpy to clean the data, matplotlib and seaborn to visualize the data, and the Random Forest and Logistic Regression models in Scikit-learn to perform the classifications.

UT Hardware as a Service

Python

Javascript

React

Flask

MongoDB

Built a full-stack web application that acts as a HaaS application that allows users to check out hardware resources and manage them. Involved building APIs to perform CRUD operations and a frontend GUI.

eHills

Java

Created a multithreaded auctioning service using Java that allowed users to bid on and buy items. Implemented using Java and JavaFX.

Array Simulator

Python

JSON

PyQTGraph

Helped build a simulator in python that models the behavior of solar cells given various external conditions. Implemented algorithms in this simulator to find the maximum power point, and measured the performance of of these algorithms using the simulator.

Spotify Rewind

Python

Flask

SpotiPy

Utilized the spotify API to build a web app in python that allows users to view their most listened to songs and artists over certain periods of time. Additionally, it shows a weighted popularity score for the songs and artists the user listens to. Built using Flask.