:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
? 实时 零代码、全功能、强安全 ORM 库 ? 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 ? Real-Time coding-free, powerful and secure ORM ? providing APIs and Docs without coding by Backend, and the returned JSON of API can be customized by Frontend(Client) users
大数据入门指南 :star:
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Apache Doris is an easy-to-use, high performance and unified analytics database.
A curated list of awesome big data frameworks, ressources and other awesomeness.
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.