zuston's repos on GitHub
CSS · 11 watchers
lizi
The tool is to fetch github discussion as blog post and deploy to vercel or github page
Rust · 6 watchers
legacy-riffle
Rust based Apache Uniffle shuffle server (riffle)
Rust · 3 watchers
curvine
High performance distributed cache system. Built by Rust.
Go · 2 watchers
AtcalMq
the message queue for ane trace big data, which serves for the maching learning prediction
Java · 1 watchers
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
0 watchers
advisor
Open-source implementation of Google Vizier for hyper parameters tuning
Java · 0 watchers
alluxio
Alluxio, formerly Tachyon, Unify Data at Memory Speed
Jupyter Notebook · 0 watchers
analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
0 watchers
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
JavaScript · 0 watchers
AtCal
Ane Trace Calculate (AtCal), Jobs for large data analysis
Java · 0 watchers
BeanMapper
choose bean mapping framework
Rust · 0 watchers
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
0 watchers
butterfree
A tool for building feature stores.
0 watchers
byteps
A high performance and generic framework for distributed DNN training
Java · 0 watchers
ByteTCC
ByteTCC Transaction Manager旨在提供一个兼容JTA的基于TCC机制的分布式事务管理器。
0 watchers
caelus
Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs
0 watchers
candle
Minimalist ML framework for Rust
0 watchers
ceresdb
CeresDB is a high-performance, distributed, cloud native time-series database.
0 watchers
CLIC
旨在提供一个跨平台计算框架来统一异构软件系统
0 watchers
CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
0 watchers
comment
Only for blog comment
0 watchers
custom-op
Guide for building custom op for TensorFlow
0 watchers
datafusion
Apache DataFusion SQL Query Engine
Rust · 0 watchers
datafusion-distributed
Repo for donation of distributed DataFusion prototype - repo name will change
0 watchers
datafusion-postgres
Serving any JSON/CSN/Parquet/Arrow files like Postgres tables with Datafusion
0 watchers
DeepLearning
深度学习入门教程&&优秀文章&&Deep Learning Tutorial
Scala · 0 watchers
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
0 watchers
direct-spark-sql
a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.
Python · 0 watchers
estimator
TensorFlow Estimator
0 watchers
fedb
FEDB is a NewSQL optimised for Realtime Inference and Decisioning applications
Java · 0 watchers
Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
Java · 0 watchers
flink
Apache Flink
0 watchers
flink-native-k8s-operator
Flink native Kubernetes Operator is a java based control plane for running Apache Flink native application on Kubernetes.
0 watchers
flink-recommandSystem-demo
:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
Java · 0 watchers
flinkx
基于flink的分布式同步工具
0 watchers
fluss
Apache Fluss is a streaming storage built for real-time analytics.
0 watchers
fluss-rust
Rust Client for Apache Fluss (Incubating)
0 watchers
fory
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
0 watchers
genie
Distributed Big Data Orchestration Service
Go · 0 watchers
go-jd
京东自动登录,在线商品自动下单
Go · 0 watchers
GoLab
go experiment
Java · 0 watchers
griffin
Mirror of Apache griffin
0 watchers
GrokkingStreamingSystems
The source code for this book: Grokking Streaming Systems: Real-time Event Processing (https://www.manning.com/books/grokking-streaming-systems).
Go · 0 watchers
guery
Distributed SQL query engine written in Go for big data
Java · 0 watchers
hadoop
Mirror of Apache Hadoop
0 watchers
hazelcast
Open-source distributed computation and storage platform
Rust · 0 watchers
hdrs
HDFS Native Client in Rust via HDFS C API libhdfs
Python · 0 watchers
hearthbreaker
A Hearthstone: Heroes of WarCraft Simulator for the purposes of Machine Learning and Data Mining
Java · 0 watchers
hive
Apache Hive
0 watchers
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
0 watchers
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
0 watchers
incubator-hudi
Upserts, Deletes And Incremental Processing on Big Data.
Java · 0 watchers
incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
0 watchers
InjectGUI
macOS Integrated Injection Framework (GUI version)
C++ · 0 watchers
io
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
JavaScript · 0 watchers
IQL
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
0 watchers
iraft
another raft protocol implementation by go programming language, just for learning raft
0 watchers
jmx_exporter
A process for exposing JMX Beans via HTTP for Prometheus consumption
Go · 0 watchers
juicefs
A distributed POSIX file system built on top of Redis and S3.
Java · 0 watchers
jvm-sandbox
Real - time non-invasive AOP framework container based on JVM
0 watchers
katib
Repository for hyperparameter tuning
0 watchers
koordinator
QoS based scheduling system for hybrid orchestration workloads on Kubernetes, bringing workloads the best layout and status.
0 watchers
kruise
Automated management of large-scale applications on Kubernetes (project under CNCF)
0 watchers
kube2hadoop
Secure HDFS Access from Kubernetes
0 watchers
KungFu
KungFu: Making Training in Distributed Machine Learning Adaptive
Python · 0 watchers
lazybones
slack控制的自动化服务,为我服务
PHP · 0 watchers
lazyphp-webDemo
the school work is based on the framework of lazyphp3
Java · 0 watchers
Leetcode
leetcode practice
Java · 0 watchers
LinkedMatrix
Hadoop Task about LinkedMatrix from neo4j
Python · 0 watchers
LMCache
Redis for LLMs
0 watchers
logforth
A versatile and extensible logging implementation.