An open-source toolkit for large-scale genomic analysis
Apache Doris is an easy-to-use, high performance and unified analytics database.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
JavaScript 对象的差异比较与修补
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
Create full-fledged APIs for slowly moving datasets without writing a single line of code.
Official code repository for GATK versions 4 and up
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Unix, R and python tools for genomics and data science
A collaboratively written review paper on deep learning, genomics, and precision medicine