Faster version of Gryphe/MythoMax-L2-13b running on multiple H100 cards in fp8 precision. Up to 160 tps.
Faster version of Gryphe/MythoMax-L2-13b running on multiple H100 cards in fp8 precision. Up to 160 tps.
07292734b30a95ddfa8c7023f89348f675db02ee
2024-05-07T23:29:58+00:00
Run models at scale with our fully managed GPU infrastructure, delivering enterprise-grade uptime at the industry's best rates.