CV | Yu S. Huang

Full Name	Yu S. Huang/黄宇
Languages	Chinese, English

2022 -

臻和 Genecast Biotechnology Corp Ltd, Shanghai/Beijing/Wuxi, China

Modelling for the Genecast Multi Cancer Early Detection (THEMIS) https://www.genecast.com.cn/solutions/detail?id=29 (Bie et al. Nature Communications 2023).
Modelling for fixed-panel and WES custom-panel MRD (Minimal Residual Disease) products.
Optimizating core bioinformatics algorithms.
Taught Bayesian Statistics, Machine/Deep Learning, Julia/Rust Programming.

2015 - 2021

Shanghai Institute of Materia Medica), Chinese Academy of Sciences 中科院

Develop AI models, algorithms, distributed computing platforms, and databases in personalized medicine (target discovery, validation, biomarker) and AI models in drug design and virtual screening.
Accucopy, Accurity in C++.
Taught Chris Bishop 2006 book "Pattern Recognition and Machine Learning".
Taught Julia programming language, Matrix Computing, Optimization.

2014-2015

Illumina Inc., San Diego, California, USA

Developed the MethylSeq app in BaseSpace (C#).
Developed a Directed-Acyclic-Graph Workflow system in C# that boosts Illumina bioinformatics workflow runtime by >50X.
Developed some bioinformatics libraries in GOlang that sped up the analysis by >100X.
Forensics, cancer, whole-genome, exome competitive analyses.

2010

University of Southern California, Los Angeles, USA

2003

Fudan University, Shanghai, China

2017-now
Accucopy
- A computational method that infers Allele-specific Copy Number alterations from low-coverage low-purity tumor sequencing Data.
2021-now
eGADA
- enhanced GADA: a fast segmentation algorithm utilizing the Sparse Bayesian Learning (or Relevance Vector Machine). It can be applied to array intensity data, NGS sequencing data, or any sequential data that displays characteristics of stepwise functions. Enhancements include: 1) a customized Red-Black tree to significantly expedite the final backward elimination step; 2) coded in C++, which is better structured than C; 3) export eGADA.so, a Python API.

Modelling & Algorithm
- Statistical Learning, Machine/Deep Learning, Optimization
Programming
- Rust, Python, C++, Julia, GO, R, Java, SQL, shell, awk
Library
- Parallel-Computing (open-MPI, MPICH), Boost C++ Library, Pegasus workflow system
SysAdmin
- PostgreSQL, MySQL DB, Lustre FS, zfs, NFS, Ceph, LDAP, K8S, Kubeflow, iptables, NGINX