Skip to content

issue/467 [BUG]: enforce safetensors index-based weight loading for robustness#469

Open
rubik-hua wants to merge 1 commit into
InfiniTensor:mainfrom
rubik-hua:model.safetensors.index
Open

issue/467 [BUG]: enforce safetensors index-based weight loading for robustness#469
rubik-hua wants to merge 1 commit into
InfiniTensor:mainfrom
rubik-hua:model.safetensors.index

Conversation

@rubik-hua

Copy link
Copy Markdown

Replace fragile glob with model.safetensors.index.json lookup to handle non-standard names (e.g., Mistral-Large-Instruct). Includes glob fallback

一般模型都包含model.safetensors.index.json文件,该文件描述了权重文件组成。

而有的模型,会有多种格式的权重文件,不能简单把所有safetensors文件加载到显存上。

image

模型验证:python examples/test_infer.py --device moore --model /home/rubik/models/Mistral-Large-Instruct-2411 --tp 8 --prompt 'introduce yourself'

image

…obustness

Replace fragile glob with model.safetensors.index.json lookup to handle non-standard names (e.g., Mistral-Large-Instruct). Includes glob fallback
@rubik-hua rubik-hua requested a review from a team June 30, 2026 08:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant