Skip to content

docs: add LoRA adapter documentation for LLMInferenceService#654

Open
andresllh wants to merge 2 commits into
kserve:mainfrom
andresllh:docs/add-lora-adapter-documentation
Open

docs: add LoRA adapter documentation for LLMInferenceService#654
andresllh wants to merge 2 commits into
kserve:mainfrom
andresllh:docs/add-lora-adapter-documentation

Conversation

@andresllh
Copy link
Copy Markdown

Add comprehensive documentation for LoRA (Low-Rank Adaptation) adapter support in LLMInferenceService, which was implemented in kserve/kserve#5317.

Changes:

  • New page: docs/model-serving/generative-inference/llmisvc/lora-adapters.md

    • Overview of LoRA adapters and benefits
    • Supported URI schemes (hf://, s3://, pvc://)
    • Configuration examples and field reference
    • Usage at inference time with OpenAI-compatible API
    • Monitoring and troubleshooting guide
  • Updated llmisvc-configuration.md with LoRA adapter configuration section

  • Updated llmisvc-overview.md with expanded LoRA mention and link

  • Updated sidebars.ts with new navigation entry

Related:

"Fixes #issue-number" or "Add description of the problem this PR solves"

Proposed Changes

@netlify
Copy link
Copy Markdown

netlify Bot commented Apr 10, 2026

Deploy Preview for elastic-nobel-0aef7a ready!

Name Link
🔨 Latest commit 70e139c
🔍 Latest deploy log https://app.netlify.com/projects/elastic-nobel-0aef7a/deploys/69e62318de023d00092da82d
😎 Deploy Preview https://deploy-preview-654--elastic-nobel-0aef7a.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Add comprehensive documentation for LoRA (Low-Rank Adaptation) adapter
support in LLMInferenceService, which was implemented in kserve/kserve#5317.

Changes:
- New page: docs/model-serving/generative-inference/llmisvc/lora-adapters.md
  - Overview of LoRA adapters and benefits
  - Supported URI schemes (hf://, s3://, pvc://)
  - Configuration examples and field reference
  - Usage at inference time with OpenAI-compatible API
  - Monitoring and troubleshooting guide

- Updated llmisvc-configuration.md with LoRA adapter configuration section
- Updated llmisvc-overview.md with expanded LoRA mention and link
- Updated sidebars.ts with new navigation entry

Related:
- Implementation PR: kserve/kserve#5317
- Sample docs: https://github.com/kserve/kserve/tree/master/docs/samples/llmisvc/lora-adapters

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Signed-off-by: Andres Llausas <allausas@redhat.com>
@andresllh andresllh force-pushed the docs/add-lora-adapter-documentation branch from 70ac1ab to 3895e79 Compare April 10, 2026 15:04
Signed-off-by: Andres Llausas <allausas@redhat.com>
Comment on lines +130 to +150
**S3 Configuration with Credentials**:

```yaml
template:
containers:
- name: storage-initializer
env:
- name: AWS_ACCESS_KEY_ID
valueFrom:
secretKeyRef:
name: s3-config
key: AWS_ACCESS_KEY_ID
- name: AWS_SECRET_ACCESS_KEY
valueFrom:
secretKeyRef:
name: s3-config
key: AWS_SECRET_ACCESS_KEY
- name: S3_ENDPOINT
value: "https://minio.example.com"
- name: S3_USE_HTTPS
value: "1"
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does this actually work?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants