Skip to content

Pull requests: scaleapi/llm-engine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add persistent external model cache via PVC
#822 opened May 6, 2026 by lukasewecker Collaborator Loading…
fix(model-engine): remediate Trivy vulnerability findings
#818 opened May 1, 2026 by scale-ballen Contributor Loading…
docs: SEV1 post-mortem for model-engine 500s incident (MLI-6574)
#817 opened Apr 29, 2026 by lilyz-ai Collaborator Loading…
2 tasks
proposal: temporal endpoint type for Temporal activity workers (MLI-6425)
#815 opened Apr 24, 2026 by lilyz-ai Collaborator Loading…
3 tasks
chore: bump model-engine image tag to latest main (2e9d0078)
#811 opened Apr 21, 2026 by lilyz-ai Collaborator Loading…
2 tasks
feat: support OpenAI 2 in Python client
#806 opened Apr 13, 2026 by scale-ballen Contributor Loading…
feat(vllm_batch): add reasoning_parser support for batch inference
#804 opened Apr 3, 2026 by lilyz-ai Collaborator Loading…
fix(gcp): surface GCP Secret Manager errors and validate DB credentials
#793 opened Mar 31, 2026 by lilyz-ai Collaborator Loading…
3 tasks
chore: bump model-engine image tag to 4de8ad99
#789 opened Mar 27, 2026 by lilyz-ai Collaborator Loading…
2 tasks
fix(vllm): accept --disable-log-requests as no-op for vLLM 0.17+ compat
#783 opened Mar 19, 2026 by lilyz-ai Collaborator Loading…
1 task done
fix: map disable_log_requests to --no-enable-log-requests for newer vLLM
#779 opened Mar 11, 2026 by lilyz-ai Collaborator Loading…
2 tasks
Feat: Update Readme to add how to publish model engine image
#766 opened Feb 24, 2026 by postevanus-scale Collaborator Loading…
feat: add ModelWeightsManager to auto-sync HF weights on endpoint creation
#761 opened Feb 20, 2026 by lilyz-ai Collaborator Loading…
6 tasks done
Inference: allow IPv4 bind host
#758 opened Feb 13, 2026 by dustinrubin5050 Loading…
add qwen3-8b-instruct as supported model
#756 opened Feb 12, 2026 by lukasewecker Collaborator Loading…
cpu only endpoints
#751 opened Jan 31, 2026 by dmchoiboi Collaborator Loading…
Revert openai schema back to 3.0.0
#740 opened Dec 18, 2025 by lilyz-ai Collaborator Loading…
Share controller 20251206 133239
#736 opened Dec 6, 2025 by Shreyasg13 Loading…
[MLI-4966] Launch support multiple routes passthrough
#722 opened Oct 14, 2025 by meher-m Contributor Loading…
Vllm batch upgrade
#715 opened Sep 24, 2025 by dmchoiboi Collaborator Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.