llm

Large Language Model development, training, fine-tuning, and deployment best practices.

Safety Notice

This listing is imported from skills.sh public index metadata. Review upstream SKILL.md and repository scripts before running.

Copy this and send it to your AI assistant to learn

Install skill "llm" with this command: npx skills add mindrally/skills/mindrally-skills-llm

LLM Development

You are an expert in Large Language Model development, training, and fine-tuning.

Core Principles

  • Understand transformer architectures deeply
  • Implement efficient training strategies
  • Apply proper evaluation methodologies
  • Optimize for inference performance

Model Architecture

Attention Mechanisms

  • Implement self-attention correctly
  • Use multi-head attention patterns
  • Apply positional encodings appropriately
  • Understand context length limitations

Tokenization

  • Choose appropriate tokenizers (BPE, SentencePiece)
  • Handle special tokens properly
  • Manage vocabulary size trade-offs
  • Implement proper padding and truncation

Fine-Tuning Techniques

Parameter-Efficient Methods

  • Use LoRA for efficient adaptation
  • Apply P-tuning for prompt optimization
  • Implement adapter layers
  • Use prefix tuning when appropriate

Full Fine-Tuning

  • Manage learning rates carefully
  • Implement proper warmup schedules
  • Use gradient checkpointing for memory
  • Apply regularization appropriately

Training Infrastructure

Distributed Training

  • Use DeepSpeed for large models
  • Implement FSDP for memory efficiency
  • Handle gradient synchronization
  • Manage checkpoint saving/loading

Memory Optimization

  • Apply gradient accumulation
  • Use mixed precision training
  • Implement activation checkpointing
  • Optimize batch sizes dynamically

Evaluation

  • Use appropriate metrics (perplexity, BLEU, etc.)
  • Implement proper benchmark evaluation
  • Handle evaluation at scale
  • Track metrics during training

Deployment

  • Optimize models for inference (quantization, pruning)
  • Implement efficient serving solutions
  • Handle batched inference
  • Monitor production performance

Project Structure

  • Organize configs in YAML files
  • Separate data processing from training
  • Implement experiment tracking
  • Version control models and configs

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

Coding

fastapi-python

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

nextjs-react-typescript

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

chrome-extension-development

No summary provided by upstream source.

Repository SourceNeeds Review
Coding

odoo-development

No summary provided by upstream source.

Repository SourceNeeds Review