Falcon
FalconLlamalpaca
AI & ML interests
Security
Recent Activity
upvoted a collection 8 days ago
PGC Psychiatric GWAS Summary Statistics updated a collection 20 days ago
Security liked a model 20 days ago
OBLITERATUS/gemma-4-E4B-it-OBLITERATEDOrganizations
Recursive language models
Multimodal
Chain of thought
-
NousResearch/Hermes-4-70B-FP8
Text Generation • 71B • Updated • 3.03k • 34 -
NousResearch/Hermes-4-405B-FP8
Text Generation • 406B • Updated • 329 • 33 -
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Paper • 2508.21365 • Published • 29 -
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
Paper • 2509.15566 • Published • 14
Agentic
-
agents-course/notebooks
Updated • 597 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 304 -
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
Paper • 2511.06221 • Published • 135 -
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper • 2511.21678 • Published • 13
Paper
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 63 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 97 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 53
Security
LLMs
-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 74 -
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 5.35M • • 13.4k -
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 1.01M • • 4.09k -
krutrim-ai-labs/Krutrim-2-instruct
Updated • 295 • 37
MLX
Diffusion
AGI
Olympic Coder Datasets
Speech & Vision LLMs
-
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.97M • • 5.8k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.95M • • 3.07k -
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text • 8B • Updated • 24.9k • 708 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 484k • 1.6k
Spaces
Datasets
Med
MLX
Recursive language models
Diffusion
Multimodal
AGI
Chain of thought
-
NousResearch/Hermes-4-70B-FP8
Text Generation • 71B • Updated • 3.03k • 34 -
NousResearch/Hermes-4-405B-FP8
Text Generation • 406B • Updated • 329 • 33 -
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Paper • 2508.21365 • Published • 29 -
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent
Paper • 2509.15566 • Published • 14
Olympic Coder Datasets
Agentic
-
agents-course/notebooks
Updated • 597 -
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
Paper • 2504.01990 • Published • 304 -
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
Paper • 2511.06221 • Published • 135 -
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper • 2511.21678 • Published • 13
Speech & Vision LLMs
-
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.97M • • 5.8k -
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 7.95M • • 3.07k -
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text • 8B • Updated • 24.9k • 708 -
microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition • 6B • Updated • 484k • 1.6k
Paper
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 63 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 97 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 53
Spaces
Security
Datasets
LLMs
-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper • 2310.17680 • Published • 74 -
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 5.35M • • 13.4k -
deepseek-ai/DeepSeek-V3
Text Generation • 685B • Updated • 1.01M • • 4.09k -
krutrim-ai-labs/Krutrim-2-instruct
Updated • 295 • 37