Open Source Projects

Xuxiaotuan.github.io

blog

1 0 2025-07-20

turbo-for90night

Inspired by https://github.com/MichaelCade/90DaysOfDevOps

1 1 2023-03-17

prism-zio

Rewrite prism, A highly concurrent system using ZIO

1 0 2025-07-09

pekko-reference

pekko-playground-reference

1 0 2025-06-07

calcite-demo

study calcite

1 0 2024-05-28

bolt

An embedded key/value database for Go.

1 0 2023-01-20

zookeeper

Apache ZooKeeper

0 0 2023-08-20

zio-spark

A functional wrapper around Spark to make it works with ZIO

0 0 2025-07-09

zio-reference

0 0 2023-07-15

zio-quickstarts

A minimal quickstart ZIO application for writing a RESTful Web Service

0 0 2023-11-11

zio-k8s

An idiomatic ZIO client for the Kubernetes API.

0 0 2025-07-09

zio-direct

Direct-Style Programming for ZIO

0 0 2023-02-22

zio-apache-parquet

Scala ZIO-powered Apache Parquet library

0 0 2024-09-25

zed

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

0 0 2024-03-07

yt-dlp

A youtube-dl fork with additional features and fixes

0 0 2024-01-24

xxt_test

test project

0 0 2025-06-26

xxt-paper-notes

paper noter

0 0 2024-07-06

xxt-game

javafx game

0 0 2024-02-02

xxt-file

0 0 2025-03-09

xxl-job

A distributed task scheduling framework.（分布式任务调度平台XXL-JOB）

0 0 2024-01-08

Xuxiaotuan

Config files for my GitHub profile.

0 0 2024-09-29

wmproxy

用Rust实现仿nginx，力争实现一个可替代方案，http/https代理, socks5代理, 负载均衡, 反向代理, 静态文件服务器，四层TCP/UDP转发，websocket转发, 内网穿透nat

0 0 2024-05-07

webmagic

A scalable web crawler framework for Java.

0 0 2019-09-18

volcano

A Cloud Native Batch System (Project under CNCF)

0 0 2025-02-28

trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

0 0 2025-03-04

tiny-llm

(🚧 WIP) a course of serving LLM on Apple Silicon for systems engineers.

0 0 2025-04-29

tikv

Distributed transactional key-value database, originally created to complement TiDB

0 0 2024-03-07

tiflow

This repo maintains DM (a data migration platform) and TiCDC (change data capture for TiDB)

0 0 2023-04-21

tidb-test-container-example

An example for testcontainers-java and TiDB.

0 0 2022-10-21

tidb

TiDB is an open source distributed HTAP database compatible with the MySQL protocol

0 0 2023-08-08

TidalFlow

Data platform

0 0 2023-10-27

TiBigData

TiDB connectors for Flink/Hive/Presto

0 0 2022-09-19

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

0 0 2023-08-02

test_db

A sample MySQL database with an integrated test suite, used to test your applications and database servers

0 0 2023-12-13

testcontainers-java

Testcontainers is a Java library that supports JUnit tests, providing lightweight, throwaway instances of common databases, Selenium web browsers, or anything else that can run in a Docker container.

0 0 2022-10-21

technical-writing-template

A sample template with guidelines for writing technical articles.

0 0 2023-08-20

tailscale

The easiest, most secure way to use WireGuard and 2FA.

0 0 2024-01-14

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

0 0 2022-10-24

superset

Apache Superset is a Data Visualization and Data Exploration Platform

0 0 2023-12-04

streamx

Make Flink|Spark easier!!! The original intention of StreamX is to make the development of Flink easier. StreamX focuses on the management of development phases and tasks. Our ultimate goal is to build a one-stop big data solution integrating stream processing, batch processing, data warehouse and data laker.

0 0 2025-07-05

streampark-flink-kubernetes-v2

Design of refactored streampark flink-kubernetes module

0 0 2023-11-28

stitch

0 0 2024-11-21

steampipe

Zero-ETL, infinite possibilities. Live query APIs, code & more with SQL. No DB required.

0 0 2023-12-14

starrocks-kubernetes-operator

Kubernetes Operator for StarRocks

0 0 2025-04-09

starrocks-connector-for-apache-spark

0 0 2024-10-17

starrocks-connector-for-apache-flink

0 0 2025-02-19

starrocks

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.

0 0 2025-06-04

spring-rs

🍃spring-rs is a application framework written in rust inspired by java's spring-boot

0 0 2024-09-20

spring-ai-alibaba

Agentic AI Framework for Java Developers

0 0 2025-06-25

speakr

Speakr is a personal, self-hosted web application designed for transcribing audio recordings

0 0 2025-06-24

spark-reference

0 0 2024-09-09

spark-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

0 0 2024-08-08

spark-kubernetes-operator

Apache Spark Kubernetes Operator

0 0 2025-07-04

spark

Apache Spark - A unified analytics engine for large-scale data processing

0 0 2025-07-06

slick

Slick (Scala Language Integrated Connection Kit) is a modern database query and access library for Scala

0 0 2024-03-15

slashbase

Modern database IDE for your dev & data workflows. Supports MySQL, PostgreSQL & MongoDB.

0 0 2024-11-14

simple-java-maven-app

For an introductory tutorial on how to use Jenkins to build a simple Java application with Maven.

0 0 2023-02-01

shenyu

Apache ShenYu is a Java native API Gateway for service proxy, protocol conversion and API governance.

0 0 2023-09-19

Sentinel

A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)

0 0 2024-05-08

secretflow

A unified framework for privacy-preserving data analysis and machine learning

0 0 2024-09-27

seatunnel-web

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

0 0 2024-10-17

seatunnel

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

0 0 2025-07-13

scheduler-plugins

Repository for out-of-tree scheduler plugins based on scheduler framework.

0 0 2025-07-08

scala-tutorials

0 0 2023-06-30

scala-reference

instructions for Scala use

0 0 2023-11-12

scala-demo

scala simple demo

0 0 2023-01-14

sbt-native-packager

sbt Native Packager

0 0 2023-02-10

RSS-Can

📰 🥫 Use RSS CAN be better and simple.

0 0 2022-12-26

rosedb

🚀 A high performance NoSQL database based on bitcask, supports string, list, hash, set, and sorted set.

0 0 2023-01-31

rnacos

Nacos server re-implemented in Rust.

0 0 2024-04-29

risingwave

RisingWave: A Distributed SQL Database for Stream Processing

0 0 2024-04-29

risinglight

An educational OLAP database system.

0 0 2023-12-04

redis-reference

0 0 2023-10-20

redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

0 0 2023-12-04

reactive-streams-jvm

Reactive Streams Specification for the JVM

0 0 2023-09-22

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

0 0 2024-12-30

qwerty-learner

为键盘工作者设计的单词记忆与英语肌肉记忆锻炼软件 / Words learning and English muscle memory training software designed for keyboard workers

0 0 2023-08-03

qv

Quickly view your data

0 0 2024-09-24

PowerJob

Enterprise job scheduling middleware with distributed computing ability.

0 0 2024-05-10

pgcapture

A scalable Netflix DBLog implementation for PostgreSQL

0 0 2023-04-22

pentaho-kettle

Pentaho Data Integration ( ETL ) a.k.a Kettle

0 0 2024-06-21

pekko-samples

Apache Pekko Sample Projects

0 0 2025-01-14

pekko-quartz-scheduler

Quartz Extension and utilities for cron-style scheduling in Apache Pekko

0 0 2024-09-20

pekko-connectors

Apache Pekko Connectors is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Apache Pekko.

0 0 2025-01-14

paper_reading_cn

0 0 2024-02-02

OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

0 0 2025-05-29

OpenDerisk

AI-Native Risk Intelligence Systems, OpenDeRisk——Your application system risk intelligent manager provides 7* 24-hour comprehensive and in-depth protection.

0 0 2025-07-01

ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

0 0 2024-10-31

nutsdb

A simple, fast, embeddable, persistent key/value store written in pure Go. It supports fully serializable transactions and many data structures such as list, set, sorted set.

0 0 2023-06-17

NorthWestFiveDog

西北五狗的刷leetcode之路

0 0 2023-04-08

night

Weekly Go Online Meetup via Bilibili｜Go 夜读｜通过 bilibili 在线直播的方式分享 Go 相关的技术话题，每天大家在微信/telegram/Slack 上及时沟通交流编程技术话题。

0 0 2024-06-17

netty

Netty project - an event-driven asynchronous network application framework

0 0 2024-01-25

navidrome

🎧☁️ Modern Music Server and Streamer compatible with Subsonic/Airsonic

0 0 2023-06-17

nacos

an easy-to-use dynamic service discovery, configuration and service management platform for building cloud native applications.

0 0 2024-02-21

mysql-binlog-stream

0 0 2024-06-25

mysql-binlog-connector-java

MySQL Binary Log connector

0 0 2024-04-24

miniob

MiniOB is one mini database, helping developers to learn how database works.

0 0 2023-09-16

MinimumViableDataspace

Guidance on documentation, scripts and integration steps on using the EDC project results

0 0 2024-11-21

mini-lsm

A tutorial of building an LSM-Tree storage engine in a week! (WIP)

0 0 2023-04-21

mill

Your shiny new Java/Scala build tool!

0 0 2024-11-21

metabase

The simplest, fastest way to get business intelligence and analytics to everyone in your company

0 0 2023-12-04

mdBook

Create book from markdown files. Like Gitbook but implemented in Rust

0 0 2023-08-11

maxwell

Maxwell's daemon, a mysql-to-json kafka producer

0 0 2023-06-08

magic-gui

0 0 2023-09-03

localsend

An open-source cross-platform alternative to AirDrop

0 0 2024-02-23

linux-dash-zh

一个漂亮, 简单的基于web的linux服务器监控面板

0 0 2023-04-22

linux-0.11

the source code of linux-0.11 for study linux kernel

0 0 2023-11-11

linkis

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

0 0 2024-07-20

leveldb

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

0 0 2022-12-27

LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）

0 0 2019-02-25

kyuubi

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

0 0 2025-03-28

kubesphere

The container platform tailored for Kubernetes multi-cloud, datacenter, and edge management ⎈ 🖥 ☁️

0 0 2023-11-16

kubernetes-client

Java client for Kubernetes & OpenShift

0 0 2024-01-11

KipSQL

build the SQL layer of KipDB database

0 0 2023-09-17

kafka

Mirror of Apache Kafka

0 0 2024-06-21

jmolecules

Libraries to help developers express architectural abstractions in Java code

0 0 2023-08-05

java-operator-sdk

Java SDK for building Kubernetes Operators

0 0 2024-05-14

intellij-magic

intellij platform plugin

0 0 2025-03-05

inlong

Apache InLong - a one-stop integration framework for massive data

0 0 2022-09-21

incubator-pekko

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

0 0 2025-01-14

incubator-paimon

Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.

0 0 2024-09-04

incubator-opendal

Apache OpenDAL: access data freely.

0 0 2023-10-24

incubator-celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

0 0 2024-02-05

ilogtail

Fast and Lightweight Observability Data Collector

0 0 2024-01-05

how-query-engines-work

This is the companion repository for the book How Query Engines Work.

0 0 2024-03-08

Home-Network-Note

🚧 持续更新 🚧 记录搭建兼顾学习娱乐的家用网络环境的过程，折腾过的一些软硬件小经验。

0 0 2022-11-12

HikariCP

光 HikariCP・A solid, high-performance, JDBC connection pool at last.

0 0 2024-09-29

headscale

An open source, self-hosted implementation of the Tailscale control server

0 0 2024-01-10

ha_xiaomi_home

Xiaomi Home Integration for Home Assistant

0 0 2024-12-23

hazelcast

Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.

0 0 2024-07-04

guava

Google core libraries for Java

0 0 2020-04-19

graduation

0 0 2020-03-08

gpt_academic

为ChatGPT/GLM提供图形交互界面，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持清华chatglm2等本地模型。兼容复旦MOSS, llama, rwkv, newbing, claude, claude2等

0 0 2023-07-29

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

0 0 2025-04-22

golang-playground

Play Golang in Web by Docker!

0 0 2022-09-06

godel-scheduler

an unified scheduler for online and offline tasks

0 0 2024-02-09

godel-rescheduler

0 0 2025-04-18

gluten

Gluten: Plugin to Double SparkSQL's Performance

0 0 2024-03-07

fyne

Cross platform GUI toolkit in Go inspired by Material Design

0 0 2024-01-23

FXGL

Java / JavaFX / Kotlin Game Library (Engine)

0 0 2024-02-02

Front-end-articles

分享我的编程经验和学习心得，订阅请点 watch。

0 0 2022-04-19

flink-magic

Personal learning flink demo

0 0 2023-06-07

flink-learning

flink learning blog. http://www.54tianzhisheng.cn 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

0 0 2022-11-12

flink-kubernetes-operator

Apache Flink Kubernetes Operator

0 0 2024-06-25

flink-cdc-connectors

CDC Connectors for Apache Flink®

0 0 2025-07-05

flink

Apache Flink

0 0 2024-09-30

FlagBoot

0 0 2024-06-25

feldera

The Feldera Incremental Computation Engine

0 0 2024-10-15

fd

A simple, fast and user-friendly alternative to 'find'

0 0 2024-09-01

etcd

Distributed reliable key-value store for the most critical data of a distributed system

0 0 2024-06-17

ebook2audiobookXTTS

Generates an audiobook with chapters and ebook metadata using Calibre and Xtts from Coqui tts, and with optional voice cloning, and supports multiple languages

0 0 2024-10-09

dtm

A distributed transaction framework, supports workflow, saga, tcc, xa, 2-phase message, outbox patterns, supports many languages.

0 0 2024-06-17

doris-streamloader

Stream Loader for Apache Doris

0 0 2024-03-14

doris-spark-connector

Spark Connector for Apache Doris

0 0 2024-09-04

doris-operator-k8s

An operator for Apache Doris that manages Doris cluster and observability components through Kubernetes CRs 😆

0 0 2024-02-01

doris-operator

Doris kubernetes operator

0 0 2024-09-10

doris-flink-connector

Flink Connector for Apache Doris

0 0 2024-03-08

doris

Apache Doris is an easy-to-use, high performance and unified analytics database.

0 0 2025-05-12

dolphinscheduler

Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.

0 0 2024-11-18

docker-flare

Flare ✨ Lightweight, high performance and fast self-hosted navigation pages, resource utilization rate is <1% CPU, MEM <30 M, Docker Image < 10M

0 0 2024-11-21

dm

Data Migration Platform

0 0 2023-06-08

dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

0 0 2024-08-22

digital_video_introduction

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸

0 0 2024-02-02

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

0 0 2024-09-11

devbox

Instant, easy, and predictable development environments

0 0 2023-11-26

deepwiki-open

Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme

0 0 2025-06-06

deep-research

My own open source implementation of OpenAI's new Deep Research agent. Get the same capability without paying $200. You can even tweak the behavior of the agent with adjustable breadth and depth. Run it for 5 min or 5 hours, it'll auto adjust.

0 0 2025-02-10

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

0 0 2024-04-24

ddia-references

Literature references for “Designing Data-Intensive Applications”

0 0 2024-04-22

dbeaver

Free universal database tool and SQL client

0 0 2024-03-18

db-readings

Readings in Databases

0 0 2023-09-28

DataX

DataX是阿里云DataWorks数据集成的开源版本。

0 0 2024-01-19

datafusion-remote-table

A DataFusion table provider for executing SQL queries on remote databases.

0 0 2025-05-15

datafusion-comet

Apache DataFusion Comet Spark Accelerator

0 0 2025-03-26

dataease

人人可用的开源数据可视化分析工具。

0 0 2023-12-07

data-learning

0 0 2022-11-12

cursor

An editor built for programming with AI 🤖

0 0 2023-03-25

cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，支持sso登录，多租户，大数据平台对接，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU，边缘计算，serverless，标注平台，自动化标注，数据集管理，大模型微调，vllm大模型推理，llmops，私有知识库，AI模型应用商店，支持模型一键开发/推理/微调，支持国产cpu/gpu/npu芯片，支持RDMA，支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式

0 0 2024-12-17

cube

📊 Cube — The Semantic Layer for Building Data Applications

0 0 2024-09-20

cs-self-learning

计算机自学指南

0 0 2023-07-29

cron4s

Cross-platform CRON expression parsing for Scala

0 0 2024-01-22

coral

Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.

0 0 2023-09-11

continue

⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains

0 0 2024-09-18

colima

Container runtimes on macOS (and Linux) with minimal setup

0 0 2024-11-06

CMAK

CMAK is a tool for managing Apache Kafka clusters

0 0 2024-03-07

CloudShuffleService

Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.

0 0 2025-01-16

client-java

TiKV Java Client

0 0 2023-08-07

chunjun

A data integration framework

0 0 2024-09-29

chinese-poetry

The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人，21050首词。

0 0 2025-05-14

chatgpt-web

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

0 0 2023-04-12

certs-maker

Tiny self-signed tool, file size between 1.5MB(binary) and 4MB (docker). Generate a self-hosted / dev certificate through configuration.

0 0 2024-01-12

canal

阿里巴巴 MySQL binlog 增量订阅&消费组件

0 0 2023-06-08

calcite

Apache Calcite

0 0 2025-01-28

blog-comments

0 0 2023-07-20

blaze

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

0 0 2025-07-06

BigData-Notes

大数据入门指南

0 0 2020-02-15

ballista-mvp

A MVP implementation of distributed query engine cut from datafusion-ballista codebase for learning purpose.

0 0 2025-01-23

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

0 0 2023-11-28

arrow-datafusion

Apache Arrow DataFusion SQL Query Engine

0 0 2024-09-24

arrow-ballista

Apache Arrow Ballista Distributed Query Engine

0 0 2025-01-23

arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

0 0 2023-12-13

arkflow

High-performance Rust stream processing engine, providing powerful data stream processing capabilities, supporting multiple input/output sources and processors.

0 0 2025-03-30

ape-dts

Ape Data Transfer Suite, written in Rust. Provides ultra-fast data replication between MySQL, PostgreSQL, Redis, MongoDB, Kafka and ClickHouse, ideal for disaster recovery (DR) and migration scenarios.

0 0 2025-04-24

akka-samples

Akka Sample Projects

0 0 2023-03-21

akka-playground-reference

Examples for Typed-Akka

0 0 2023-03-21

akka-playground

Examples for Typed-Akka

0 0 2023-03-21

akka-http

The Streaming-first HTTP server/module of Akka

0 0 2023-10-27

akka-guide

🌴 A chinese guide of Akka, based on Java.

0 0 2023-01-12

akka

Build highly concurrent, distributed, and resilient message-driven applications on the JVM

0 0 2023-07-01

A-Programmers-Guide-to-English

专为程序员编写的英语学习指南 v1.2。在线版本请点 ->

0 0 2019-01-26