Skip to main content
marin-community
Projects
marin
Reports
Benchmark on Splash Attention
Log in
Sign up
Share
Comment
Star
Share
Comment
Star
Benchmark on Splash Attention
Ivan Zhou
Created on May 24
|
Last edited on May 25
Comment
Comparing Splash Attention and Flash Attention:
Training throughput improves from 2.8M tokens/sec to 3.2M tokens/sec
Training and eval loss curves of two runs also match closely. Number wise, the splash attention even seems to be slightly better.
Section 1
train/loss
train/loss
Select runs that logged train/loss
to visualize data in this line chart.
throughput/tokens_per_second
throughput/tokens_per_second
Select runs that logged throughput/tokens_per_second
to visualize data in this line chart.
Run set
Run set
Run set
Run set
Run set
Run set
Run set
Run set
Add a comment