Tags · dataelement/llama.cpp

b3173

update: support Qwen2-57B-A14B (ggml-org#7835)

* update: convert-hf-to-gguf.py to support Qwen2-57B-A14B

* fix: QWEN2MOE support for expert_feed_forward_length

previously, expert ff was taken from n_ff (intermediate size) but it is now properly taken from LLM_KV_EXPERT_FEED_FORWARD_LENGTH

n_ff_exp and n_ff_shared_exp are now properly calculated

* update: convert-hf-to-gguf.py cleanup for Qwen2MoeForCausalLM

* fix: QWEN2MOE support for expert_feed_forward_length

previously, expert ff was taken from n_ff (intermediate size) but it is now properly taken from LLM_KV_EXPERT_FEED_FORWARD_LENGTH

n_ff_exp and n_ff_shexp are now properly calculated

Jun 17, 2024
a94e6ff
zip
tar.gz
Downloads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b3173

Tags: dataelement/llama.cpp

b3173