Model Configuration

Model configuration describes which foundation model you fine-tune, how you adapt it, and how you deploy it. LLMTune supports 17+ model families across all major modalities.

Supported Model Families

LLMTune maintains a model catalog and supports:

LLaMA â€“ LLaMA 3.3, LLaMA 3.4, and variants (Meta)
Mistral â€“ Mistral Nemo, Mistral 7B, and variants
Qwen â€“ Qwen3, Qwen-VL, Qwen2-Audio, Qwen2-VL-Video, and variants
DeepSeek â€“ DeepSeek R1, DeepSeek-Coder, and variants
And more â€“ 17+ model families total

Each model entry in LLMTune Models includes:

Provider information
Context length
Parameter count
Latency expectations
Recommended use cases
Deployment notes
Evaluation metrics

Model Selection

Browse the Catalog

Navigate to LLMTune Models from the main navigation.
Browse the curated catalog of production-ready models.
Use filters to find models by:
- Provider
- Modality (text, vision, audio, code, etc.)
- Size (parameter count)
- Use case
Compare models side-by-side to find the best fit.

Model Comparison

Side-by-side comparison â€“ See multiple models at once
Performance metrics â€“ Compare latency, quality, and cost
Usage guidance â€“ Read recommendations for each model
Deployment notes â€“ Review deployment best practices

Adaptation Strategies

FineTune Studio supports multiple training methods:

Parameter-Efficient Methods

LoRA / QLoRA â€“ Fast iteration, lower compute costs
SFT â€“ Supervised fine-tuning with labeled data

Full Fine-Tuning

Full fine-tune â€“ Available for select models when deeper changes are required
PPO / RLAIF â€“ Reinforcement learning methods

Specialized Methods

Code Generation â€“ Optimized for code tasks
Multimodal â€“ Vision-language model training
Audio methods â€“ Audio understanding, ASR, TTS
Embeddings â€“ Text-to-embeddings training

Training Configuration

Hyperparameters

Key settings you can configure:

Setting	Description
Learning rate	Default varies by method; typically 0.0001 for SFT
Batch size	Adaptive based on model size and dataset
Epochs	Typically 2â€“5 for most methods
Evaluation cadence	Frequency of validation runs during training

Compute Options

Choose your compute model:

Traditional Computing â€“ Single location, predictable performance
- Single Instance or GPU Cluster
Federated Computing â€“ Distributed across global nodes
- Privacy-preserving, unlimited scale, lower costs
- Single Instance or GPU Cluster

Deployment Configuration

When deploying a fine-tuned model:

Version Control

Version tagging â€“ Tag each deployment with semantic versions
Change logs â€“ Document what changed in each version
Approval workflows â€“ Require approvals before promotion

Traffic Management

Canary deployments â€“ Gradually shift traffic to new versions
Shadow deployments â€“ Test without affecting production
Blue/Green â€“ Instant switch between versions
Traffic splitting â€“ Route percentage of traffic

Autoscaling

Min/max replicas â€“ Configure scaling bounds
Timeout settings â€“ Set request timeouts
Resource allocation â€“ Configure CPU/memory limits

Deployment Metadata

Deployed models store comprehensive metadata:

Base model identifier â€“ Original foundation model
Dataset references â€“ Which datasets were used
Training configuration â€“ Hyperparameters and method
Training job ID â€“ Link back to training run
Endpoint URL â€“ API endpoint for inference
Current status â€“ Active, paused, retired
Version history â€“ All previous versions

Use this metadata to:

Reproduce runs â€“ Recreate training configurations
Audit model lineage â€“ Track model evolution
Debug issues â€“ Understand what changed between versions
Compliance â€“ Document model provenance

Best Practices

Start small â€“ Test with smaller models before scaling up
Use playground datasets â€“ Validate your approach quickly
Monitor training â€“ Watch metrics in real-time
Evaluate before deploying â€“ Use LLMTune Evaluate to test models
Version everything â€“ Tag and document all deployments
Plan rollbacks â€“ Know which version to rollback to

Next Steps

Browse the Model Catalog to see available models
Learn about Fine-Tuning to train models
Read the Deployment Guide to deploy models
Check Model Configuration for advanced settings

​Model Configuration

​Supported Model Families

​Model Selection

​Browse the Catalog

​Model Comparison

​Adaptation Strategies

​Parameter-Efficient Methods

​Full Fine-Tuning

​Specialized Methods

​Training Configuration

​Hyperparameters

​Compute Options

​Deployment Configuration

​Version Control

​Traffic Management

​Autoscaling

​Deployment Metadata

​Best Practices

​Next Steps