Skip to content
View simon-mo's full-sized avatar
Block or Report

Block or report simon-mo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 10,786 798 Updated May 23, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 21,880 3,083 Updated Jul 1, 2024

A multi-level tensor algebra superoptimizer

C++ 261 16 Updated Jun 29, 2024

MLX: An array framework for Apple silicon

C++ 15,625 889 Updated Jun 30, 2024

A library for building fast, reliable and evolvable network services.

Rust 20,221 1,097 Updated Jun 28, 2024

A user-friendly CMS for static site generators.

Vue 1,196 126 Updated Jun 19, 2024

A container runtime written in Rust

Rust 5,975 331 Updated Jul 1, 2024

A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data a…

TypeScript 2,124 85 Updated Jun 28, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 747 61 Updated Jun 30, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,450 955 Updated Jun 22, 2024

a unified scheduler for online and offline tasks

Go 375 58 Updated Jun 24, 2024

Build better UIs faster.

Python 7,768 298 Updated Jun 26, 2024

A SolidJS diagramming framework

TypeScript 72 4 Updated Jun 25, 2024

simple markdown editor w inline comments, on latest automerge stack

TypeScript 192 16 Updated Jun 28, 2024

Ensō is a high-performance streaming interface for NIC-application communication.

SystemVerilog 60 6 Updated Jun 28, 2024

Kepler (Kubernetes-based Efficient Power Level Exporter) uses eBPF to probe performance counters and other system stats, use ML models to estimate workload energy consumption based on these stats, …

Go 1,050 167 Updated Jun 27, 2024

a Rust library for parsing, validating, and modifying Dockerfiles

Rust 38 12 Updated Mar 5, 2024

Docker image identifier parser.

Go 14 5 Updated May 11, 2020

Module, Model, and Tensor Serialization/Deserialization

Python 146 24 Updated Jun 29, 2024

SQL databases in Python, designed for simplicity, compatibility, and robustness.

Python 13,514 611 Updated Jun 27, 2024

Large Language Model Text Generation Inference

Python 8,311 939 Updated Jun 30, 2024

LLM papers I'm reading, mostly on inference and model compression

680 29 Updated Dec 21, 2023

A fast, clean, responsive Hugo theme.

HTML 9,077 2,479 Updated Jun 26, 2024

A monitor of resources

C++ 17,969 578 Updated Jun 29, 2024

A lightweight quadtree implementation for javascript

JavaScript 590 116 Updated Nov 11, 2023

S3 Service Adapter

Rust 117 30 Updated Jun 9, 2024

Rust crates to extend containerd

Rust 165 60 Updated Jun 25, 2024

得意黑 Smiley Sans:一款在人文观感和几何特征中寻找平衡的中文黑体

HTML 12,819 365 Updated Mar 16, 2024

🔥 Blazing fast bulk data transfers between any cloud 🔥

Python 986 58 Updated May 11, 2024

Standalone Kubelet Tutorial

Go 207 13 Updated Oct 11, 2017
Next