Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

corrected Pydantic warning.
#2095 opened Jun 20, 2024 by yukiman76 Loading…
1 task
feat: add simple tests for weights
#2092 opened Jun 19, 2024 by drbh Loading…
CI Login to dockerhub
#2086 opened Jun 18, 2024 by glegendre01 Draft
5 tasks
Support exl2-quantized Qwen2 models
#2085 opened Jun 18, 2024 by danieldk Loading…
5 tasks
Fix missing rope scaling option for YaRN
#2078 opened Jun 17, 2024 by calycekr Loading…
5 tasks
Add OTLP Service Name Environment Variable
#2076 opened Jun 17, 2024 by KevinDuffy94 Loading…
3 of 5 tasks
Support HF_TOKEN environment variable
#2066 opened Jun 13, 2024 by Wauplin Loading…
Add support for Docker Compose
#2063 opened Jun 12, 2024 by StefanDanielSchwarz Loading…
1 of 5 tasks
fix: set sharded true if WORLD_SIZE is set
#2062 opened Jun 12, 2024 by drbh Loading…
Factor out sharding of packed tensors
#2059 opened Jun 12, 2024 by danieldk Loading…
1 of 5 tasks
use xpu-smi to dump used memory
#2047 opened Jun 11, 2024 by sywangyi Loading…
5 tasks
Enabling CI for AMD with new runner..
#2034 opened Jun 6, 2024 by Narsil Loading…
5 tasks
feat: re-allocate pages dynamically
#2024 opened Jun 5, 2024 by OlivierDehaene Loading…
Enable multiple LoRa adapters
#2010 opened Jun 4, 2024 by drbh Loading…
Cpu tgi
#1936 opened May 23, 2024 by sywangyi Loading…
5 tasks
ProTip! What’s not been updated in a month: updated:<2024-05-19.