Skip to content
View czczup's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@OpenGVLab
Block or Report

Block or report czczup

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning

Python 45 1 Updated Jun 16, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

149 2 Updated Jun 16, 2024

This is the official implementation of the paper "Needle In A Multimodal Haystack"

Python 42 4 Updated Jun 20, 2024
TypeScript 113 8 Updated Jun 21, 2024

Parameter-Inverted Image Pyramid Networks (PIIP)

Python 39 2 Updated Jun 25, 2024

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

269 8 Updated Jun 18, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,126 250 Updated Jun 26, 2024
Python 14 Updated Jun 1, 2024
Python 71 3 Updated Jun 9, 2024
HTML 2 Updated May 31, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 859 55 Updated Jun 21, 2024

PsyDI: A MBTI agent that helps you understand your personality type through a relaxed multi-modal interaction.

TypeScript 52 1 Updated Jun 22, 2024

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…

Python 401 28 Updated Apr 21, 2024
Python 7 Updated May 27, 2024

MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)

Python 97 3 Updated May 27, 2024

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,030 92 Updated Jun 13, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,092 331 Updated Jun 25, 2024

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Phi3-Vision, ...)

Python 2,085 202 Updated Jun 26, 2024

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.

Python 79 6 Updated Jun 19, 2024

LLM inference in C/C++

C++ 60,790 8,677 Updated Jun 26, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,833 968 Updated Jun 25, 2024

ICML'2024 | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI

Python 63 2 Updated Jun 25, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Python 3,735 287 Updated Jun 20, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,017 52 Updated Jun 25, 2024

[ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments

Python 16 1 Updated Mar 21, 2024

Data and benchmark code for the EgoExoLearn dataset

Python 34 Updated May 30, 2024

Open source implementation and models of One-step Diffusion with Distribution Matching Distillation

Python 93 11 Updated May 26, 2024

Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning

Python 345 18 Updated May 17, 2024

The suite of modeling video with Mamba

Python 171 17 Updated May 14, 2024
Next