Tanay Komarlu

Tanay Komarlu

MSCS Graduate from the University of Illinois Urbana Champaign

Biography

Hello! I’m Tanay, a Master’s Graduate from the Data Mining Group @ UIUC (2021-2023) advised by Professor Jiawei Han. I enjoy creating software that lives on the internet, whether that be websites, applications, or anything in between. My work focuses on Machine Learning Engineering, Natural Language Understanding and Biomedical Text Mining.

Interests
  • Text Mining
  • Natural Language Processing
  • Large Language Models
Education
  • MS in Computer Science, 2023

    University of Illinois Urbana Champaign

  • BS in Computer Science, 2021

    University of California Santa Barbara

Experience

 
 
 
 
 
Google
Software Engineering Intern
Google
May 2022 – Aug 2022 California
  • Enabled a Web-based IDE integration test stack to handle non-deterministic user data in RPC calls
  • Engineered an automated integration test suite to preemptively identify regressions in Web-based IDE File Explorer User Interface
  • Designed and implemented a hermetic integration testing solution with the use of application-level load balancer and API proxy SUTs for Google extension developers to test extensions that leverage gRPC Web connections
 
 
 
 
 
Roche
Machine Learning Intern
Roche
Jun 2021 – Aug 2021 California
  • Developed a relationship extraction model using distant supervision to identify chemical and disease relationships in NCBI Open Access documents
  • Engineered a normalization pipeline for NCBI documents in Java and Python. Researched and implemented tokenization and approximate nearest neighbor algorithms
 
 
 
 
 
University of California Santa Barbara
Undergraduate Research Assistant
University of California Santa Barbara
Jan 2021 – Jun 2021 California
  • Created an algorithm to process subtitled videos and outputs a video containing only human dialogue. The algorithm performs speaker segmentation and leverages Tessaract with OpenCV to return frame captures of the text mapped to timestamps
 
 
 
 
 
University of California Santa Barbara
Undergraduate Learning Assistant
University of California Santa Barbara
Oct 2020 – Dec 2021 California
  • Groomed and managed three epics for React and Spring Boot web applications based on user testing and feedback
  • Mentored and performed code reviews for three teams of students in CMPSC 156, a web application development course teaching legacy code development using Java, Javascript, Spring Boot, React, Git / GitHub, Agile workflows, and CI/CD testing
 
 
 
 
 
Machine Learning Intern
Nggwae Nirman Healthcare Technologies
Jun 2020 – Sep 2020 California
  • Developed a machine learning model as measured by a precision of 98% to predict NPS of future customers for a B2B logistics firm and improved productivity by 10%
  • Engineered a Patient No-Show predictive model as measured by a precision of 95% using Scikit-Learn and Pandas libraries in Python for use in Pediatric Clinics

Publications

(2023). MEGClass: Text Classification with Extremely Weak Supervision via Mutually-Enhancing Text Granularities. In Progress.

DOI

(2021). Teaching Testing with Modern Technology Stacks in Undergraduate Software Engineering Courses. In ITiCSE.

DOI

Accomplish­ments

Natural Language Processing
See certificate
Machine Learning
See certificate