TransformerAnalyzer / calc_util.py

Commit History

add client throughput
c93009d

Alan Liu commited on

add generation arithmetic intensity
ed50ee5

Alan Liu commited on

add prefill memory
5f0df3a

Alan Liu commited on

inference speed
3698d0a

Alan Liu commited on