czczup

🎯

Focusing

Zhe Chen czczup

🎯

Focusing

Knowledge is infinite.

229 followers · 66 following

Nanjing University
Nanjing
https://czczup.github.io/

Achievements

x3 x2

BetaSend feedback

Achievements

x3 x2

BetaSend feedback

Highlights

Organizations

Block or Report

Block or report czczup

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

OpenGVLab / LCL

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Python 45 1 Updated Jun 16, 2024

OpenGVLab / OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

149 2 Updated Jun 16, 2024

OpenGVLab / MM-NIAH

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Python 42 4 Updated Jun 20, 2024

opendatalab / LabelLLM

TypeScript 113 8 Updated Jun 21, 2024

OpenGVLab / PIIP

Parameter-Inverted Image Pyramid Networks (PIIP)

Python 39 2 Updated Jun 25, 2024

BradyFU / Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

269 8 Updated Jun 18, 2024

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,126 250 Updated Jun 26, 2024

yuecao0119 / MMFuser

Python 14 Updated Jun 1, 2024

alibaba / conv-llava

Python 71 3 Updated Jun 9, 2024

InternVL / InternVL.github.io

HTML 2 Updated May 31, 2024

Efficient-Large-Model / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 859 55 Updated Jun 21, 2024

opendilab / PsyDI

PsyDI: A MBTI agent that helps you understand your personality type through a relaxed multi-modal interaction.

TypeScript 52 1 Updated Jun 22, 2024

OpenGVLab / Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…

Python 401 28 Updated Apr 21, 2024

zehanwang01 / FreeBind

Python 7 Updated May 27, 2024

mulanai / MuLan

MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)

Python 97 3 Updated May 27, 2024

OpenBMB / VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,030 92 Updated Jun 13, 2024

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,092 331 Updated Jun 25, 2024

modelscope / swift

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Phi3-Vision, ...)

Python 2,085 202 Updated Jun 26, 2024