DeepSeek V4 Replicas Small-scale faithful replicas of the DeepSeek-V4 architecture for ablation and weight-transfer research. kshitijthakkar/deepseek-v4-mini-300M-init Text Generation • 0.3B • Updated Apr 29 • 31 kshitijthakkar/deepseek-v4-mini-1B-init Text Generation • 1B • Updated Apr 29 • 15 kshitijthakkar/deepseek-v4-mini-3B-init Text Generation • 3B • Updated Apr 29 • 7 kshitijthakkar/deepseek-v4-mini-6B-init Text Generation • 8B • Updated Apr 30 • 40 • 2
mcp-server-bench This is a collection of Benchmarking results between Gradio and FastMCP kshitijthakkar/mcp-server-bench Viewer • Updated Feb 27 • 360 • 173 kshitijthakkar/mcp-server-bench-gradio-optimized Viewer • Updated Mar 2 • 48 • 67 kshitijthakkar/mcp-server-bench-gradio Viewer • Updated Mar 4 • 12 • 82 kshitijthakkar/mcp-server-bench-gradio-optimized-full-bench Viewer • Updated Mar 2 • 337 • 94
DeepSeek V4 Replicas Small-scale faithful replicas of the DeepSeek-V4 architecture for ablation and weight-transfer research. kshitijthakkar/deepseek-v4-mini-300M-init Text Generation • 0.3B • Updated Apr 29 • 31 kshitijthakkar/deepseek-v4-mini-1B-init Text Generation • 1B • Updated Apr 29 • 15 kshitijthakkar/deepseek-v4-mini-3B-init Text Generation • 3B • Updated Apr 29 • 7 kshitijthakkar/deepseek-v4-mini-6B-init Text Generation • 8B • Updated Apr 30 • 40 • 2
mcp-server-bench This is a collection of Benchmarking results between Gradio and FastMCP kshitijthakkar/mcp-server-bench Viewer • Updated Feb 27 • 360 • 173 kshitijthakkar/mcp-server-bench-gradio-optimized Viewer • Updated Mar 2 • 48 • 67 kshitijthakkar/mcp-server-bench-gradio Viewer • Updated Mar 4 • 12 • 82 kshitijthakkar/mcp-server-bench-gradio-optimized-full-bench Viewer • Updated Mar 2 • 337 • 94
Runtime error Agents 1 E-Commerce Product Content Generator 🛒 Generate product photos and marketing copy for e‑commerce
Running Agents 1 AI Content Creation Pipeline 🎨 Generate complete social media posts from a text prompt
kshitijthakkar/deepseek-v4-mini-300M-recovered Text Generation • 0.3B • Updated about 1 hour ago • 11 • 1
kshitijthakkar/deepseek-v4-mini-300M-recovered-h100 Text Generation • 0.3B • Updated about 13 hours ago • 12
kshitijthakkar/deepseek-v4-mini-300M-recovered-wip Text Generation • 0.3B • Updated about 18 hours ago • 16
kshitijthakkar/deepseek-v4-mini-300M-from-flash Text Generation • 0.3B • Updated 28 days ago • 320 • 5