Full Mamba (SSM) with Agent Attention and Fast Feed Forward Sparse Activations | Los Angeles .

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

January 10, 2024 · Los Angeles

Mamba: Agent Attention, Sparse FFN

Explore a custom Mamba implementation integrating agent attention and sparse feed-forward activations, demonstrating faster language modeling and promising results in under 24 hours.

Overview
Tech stack