PredictionIO, a machine learning server for developers and ML engineers.
The official home of the Presto distributed SQL query engine for big data
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
CMAK is a tool for managing Apache Kafka clusters
ClickHouse is a free analytics DBMS for big data
Apache Spark - A unified analytics engine for large-scale data processing
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Apache Flink
An open source cybersecurity protocol for syncing decentralized graph data.