Thank you very much for checking out my GitHub page!
I'm a Bioinformatician and full-stack web developer, experienced in developing and automating next-generation sequencing (NGS) data analysis pipelines. My experience spans various aspects of bioinformatics and web development, including the analysis of clinical whole-exome, whole-genome, and RNA-seq samples.
In my previous roles, I've developed databases containing variant annotations and their allele frequencies across different populations using in-house cohorts. I've also created a real-time dashboard for users to monitor the status of the NGS variant calling pipeline and access QC metrics and other results. Additionally, I've contributed to the development of a novel machine-learning approach that predicts the immunogenicity of a peptide using various features derived from amino-acid composition, HLA-binding, structural features, peptide processing, and peptide transport. My work experience also includes developing scripts and tools for interacting with public databases.
-
Developed and published various workflows using the Workflow Description Language (WDL):
-
SRA annotator: A command-line tool for retrieving annotations from the NCBI SRA database. (Python)
-
ARA (Automated Record Analysis): An automatic pipeline for exploration of SRA datasets with sequences as a query (Perl, Docker)
-
Covid19 tracker (Python Django, HighchartsJS):
-
Word usage using dictionary API (ReactJS):
-
Netflix-clone using themoviedb API (ReactJS):
- WGS, WES and RNA-seq data analysis
- Somatic and germline variant calling and annotation
- Data analysis pipeline development and automation