About
Highly accomplished Senior Research Software Engineer and Engineering Lead with extensive experience in bioinformatics, specializing in full-stack development of high-performance web applications for genomic and multi-modal single-cell datasets. Proven leader in designing scalable systems, integrating AI/ML models, and delivering intuitive scientific user experiences to computational biologists and analysts. Expert in Python, JavaScript, and R, with a strong track record of developing open-source tools and driving data-driven insights for complex scientific challenges.
Work
Genentech Research and Early Development (gRED), A Member of Roche Group
|Sr. Research Software Engineer, Engineering Lead
South San Francisco, CA, US
→
Summary
Led the full-stack engineering and development of high-performance web applications for bioinformatics, integrating AI/ML models and harmonizing diverse datasets to deliver scalable scientific user experiences.
Highlights
Led engineering initiatives to develop scalable and interactive applications for indexing, searching, and harmonizing public and internal collections of multi-modal single-cell datasets.
Spearheaded the integration of interactive analytical workflows and AI/ML models across extensive multi-modal dataset collections, enhancing data utility for biologists.
Developed and launched open-source interactive exploratory tools, including Kana (https://github.com/kanaverse), for browser-based analysis and visualization of multi-modal single-cell datasets utilizing WebAssembly.
Directed Python development efforts at gRED to enable Bioconductor workflows and establish language-agnostic data storage systems, with significant contributions open-sourced in BiocPy (https://github.com/biocpy).
University of Maryland
|Faculty Specialist
College Park, MD, US
→
Summary
Advanced bioinformatics tools and methodologies as a Research Software Engineer at the Center for Bioinformatics and Computational Biology, leading software engineering initiatives for omic dataset visualization.
Highlights
Led software engineering initiatives for the creation of reusable web components and systems, enabling interactive visual analysis of genomic and genetic datasets.
Developed and maintained Epiviz (http://www.epiviz.org) for interactive visualization of genetic and epigenetic datasets, enhancing research capabilities.
Created the Metaviz (http://www.metaviz.org) suite of tools specifically designed for metagenomic analyses, expanding data interpretation capabilities.
National Center for Computational Toxicology (NCCT), U.S. Environmental Protection Agency
|Oakridge Science Research Fellow
Durham, NC, US
→
Summary
Developed interactive dashboard systems and data mining processes to visualize and update High Throughput Screening (HTS) data for the ToxCast program.
Highlights
Developed interactive dashboard systems for exploring and visualizing High Throughput Screening (HTS) data from the ToxCast program, improving data accessibility.
Engineered robust data mining processes to continuously update toxicity and compound information from thousands of sources within the ACTOR database, ensuring data currency and reliability.
National Center for Environmental Assessment (NCEA), U.S. Environmental Protection Agency
|Student Research Trainee
Arlington, VA, US
→
Summary
Collaborated with the US Census & sustainable communities to develop a platform for sharing environmental data and facilitating sustainable decision-making.
Highlights
Identified key use cases and co-created an ideation platform to engage stakeholders and communities in making sustainable environmental decisions.
Developed interactive tools to visualize shared environmental data, enhancing public access and understanding of critical information.
Education
North Carolina State University
→
Masters
Computer Science
V.R. Siddhartha Engineering College, Affiliated to Nagarjuna University
→
Bachelors
Computer Science and Engineering
Publications
Skills
Programming Languages
JavaScript, Python, R, Golang.
Data Management
Neo4j, MySQL, DuckDB.
Web Frameworks & Technologies
React, d3Js, Lit (Web components), WebGL, WebAssembly.
DevOps & Cloud
Terraform, AWS.
Bioinformatics & Genomics
Single-Cell RNA-seq analysis, Genomic Data Visualization, Epigenetic Data, Metagenomic Analysis, Bioconductor Workflows, Omic Datasets, High Throughput Screening (HTS).
Software Engineering
Full Stack Development, Scalable Applications, Backend Systems, Web Components, Open Source Development, AI/ML Integration, Data Mining, System Design.