Small LMs (10M-9B) fine-tuned for financial sentiment with chain-of-thought reasoning. SFT + GRPO across 8 architectures. Base + GGUF builds.
Ayan Shaikh
Ayansk11
AI & ML interests
Large Language Models, LoRA/PEFT Fine-tuning, Reinforcement Learning (GRPO, PPO), Multi-Agent Systems, RAG Pipelines, Financial NLP, Cybersecurity AI, Chain-of-Thought Reasoning
Recent Activity
updated a model about 11 hours ago
Ayansk11/FinSenti-Gemma3-270M updated a model 3 days ago
Ayansk11/FinSenti-MobileLLM-R1-950M updated a model 4 days ago
Ayansk11/FinSenti-Tiny-LLM-10MOrganizations
None yet