worstchan/EAT-base_epoch30_finetune_AS2M Image Feature Extraction • 90.4M • Updated May 6, 2025 • 16k • 3
worstchan/EAT-large_epoch20_finetune_AS2M Image Feature Extraction • 0.3B • Updated May 6, 2025 • 125 • 3
EAT: Self-Supervised Pre-Training with Efficient Audio Transformer Paper • 2401.03497 • Published Jan 7, 2024 • 1