Mike Brummett's picture

Mike Brummett PRO

GoDjMike

AI & ML interests

Edge detection, road anomaly identification, story-generation libraries

Recent Activity

liked a model about 8 hours ago
microsoft/Phi-4-multimodal-instruct
liked a model 1 day ago
mlx-community/jinaai-ReaderLM-v2
liked a model 1 day ago
predibase/e2e_nlg
View all activity

Organizations

None yet

GoDjMike's activity

reacted to nicolay-r's post with ๐Ÿš€ 5 days ago
view post
Post
3703
๐Ÿ“ข If you're looking for translating massive dataset of JSON-lines / CSV data with various set of source fields, then the following update would be relevant. So far and experimenting with adapting language specific Sentiment Analysis model, got a change to reforge and relaese bulk-translate 0.25.2.
โญ๏ธ https://github.com/nicolay-r/bulk-translate/releases/tag/0.25.2

The update has the following major features
- Supporting schemas: all the columns to be translated are now could be declared within the same prompt-style format. using json this automatically allows to map them onto output fields
- The related updates for shell execution mode: schema parameter is now available alongside with just a prompt usage before.

Benefit is that your output is invariant. You can extend and stack various translators with separated shell laucnhes.

Screenshot below is the application of the google-translate engine in manual batching mode.
๐Ÿš€ Performance: 2.5 it / sec (in the case of a single field translation)

๐ŸŒŸ about bulk-translate: https://github.com/nicolay-r/bulk-translate
๐ŸŒŒ nlp-thirdgate: https://github.com/nicolay-r/nlp-thirdgate?tab=readme-ov-file
  • 1 reply
ยท
reacted to clem's post with ๐Ÿ”ฅ about 1 month ago
view post
Post
7191
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!