Cauvery-7b / About
dharun2049's picture
Create About
c981ab2
raw
history blame contribute delete
623 Bytes
so the use of llms in daily life is increasing however only english is the language of all the base models , plus also the current state of the art llms
are created using the transformer architecture, which has proven to be the industry standard however due to its self attention mechanism
its been computationally inefficient so we are proposing Cauvery 7b , a 7 billion parameter large language model currently under development
that DOES NOT USE THE TRANSFORMER ARCHITECTURE AND AS AN ALTERNATIVE USES THE retentive network architecture with retention mechanism, we are
in our early stages and looking for investors