pythonadvanced
Fine-Tune Embeddings with SetFit
Fine-tune a sentence embedding model on a small labelled dataset using the SetFit framework.
pythonPress ⌘/Ctrl + Shift + C to copy
from setfit import SetFitModel, SetFitTrainer
from datasets import Dataset
texts = ['great product', 'loved it', 'terrible quality', 'awful experience', 'excellent service', 'very bad', 'highly recommend', 'waste of money']
labels = [1, 1, 0, 0, 1, 0, 1, 0]
dataset = Dataset.from_dict({'text': texts, 'label': labels})
model = SetFitModel.from_pretrained('sentence-transformers/all-MiniLM-L6-v2', labels=['negative','positive'])
trainer = SetFitTrainer(
model=model,
train_dataset=dataset,
eval_dataset=dataset,
metric='accuracy',
num_iterations=20,
)
trainer.train()
metrics = trainer.evaluate()
print('Accuracy:', metrics['accuracy'])
predictions = model.predict(['amazing!', 'complete disaster'])
print('Predictions:', predictions)Use Cases
- few-shot classification
- custom embeddings
- domain adaptation
Tags
Related Snippets
Similar patterns you can reuse in the same workflow.
typescriptbeginner
Generate Text Embeddings with OpenAI
Create vector embeddings for semantic search and similarity matching using text-embedding-3-small.
Best for: semantic search
#openai#embeddings
typescriptadvanced
RAG Pipeline (Retrieve + Augment + Generate)
Minimal RAG implementation: embed a query, retrieve top-k chunks, inject into prompt.
Best for: document Q&A
#rag#embeddings
typescriptintermediate
Pinecone Vector Store Operations
Store and query vector embeddings with Pinecone for semantic search and similarity matching.
Best for: Semantic search engines
#pinecone#vector-store
typescriptbeginner
Few-Shot Prompt Template
Build structured few-shot prompts with examples, system instructions, and output format constraints.
Best for: Consistent AI outputs
#prompts#few-shot