|
|
|
|
|
by deepsquirrelnet
5 days ago
|
|
If you want to go deeper on language models, try these project ideas: - Zero-shot encoders like tasksource or GliNER - Natural language inference: https://huggingface.co/blog/dleemiller/nli-xenc-ways-to-use - GRPO training - GEPA prompt tuning Qwen 0.6B (or GEPA, then GRPO) - Use an embedding model and train a classifier (MLP, logistic, svm) - Use a larger LLM to generate a synthetic dataset (beware of lack of diversity, mine "seed text" from real sources first) - Synthetically generate "hard examples" where more than one category may be valid and DPO tune your preferred responses |
|