Lamar
lamaraguilar
Β·
AI & ML interests
chatgpt
Recent Activity
reacted
to
nroggendorff's
post
with π€―
about 2 months ago
hey nvidia, can you send me a gpu?
comment or react if you want ~~me~~ to get one too. ππ
reacted
to
nroggendorff's
post
with π
about 2 months ago
hey nvidia, can you send me a gpu?
comment or react if you want ~~me~~ to get one too. ππ
reacted
to
nroggendorff's
post
with π€
about 2 months ago
hey nvidia, can you send me a gpu?
comment or react if you want ~~me~~ to get one too. ππ
Organizations
None yet
lamaraguilar's activity

reacted to
nroggendorff's
post with π€―ππ€ππ§ β€οΈπ€ππππ₯β
about 2 months ago

reacted to
anton-l's
post with π
2 months ago
Post
2520
Introducing ππ
π’π§πππππ‘: the best public math pre-training dataset with 50B+ tokens!
HuggingFaceTB/finemath
Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH.
We build the dataset by:
π οΈ carefully extracting math data from Common Crawl;
π iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction.
We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets.
We hope this helps advance the performance of LLMs on math and reasoning! π
Weβre also releasing all the ablation models as well as the evaluation code.
HuggingFaceTB/finemath-6763fb8f71b6439b653482c2
HuggingFaceTB/finemath
Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH.
We build the dataset by:
π οΈ carefully extracting math data from Common Crawl;
π iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction.
We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets.
We hope this helps advance the performance of LLMs on math and reasoning! π
Weβre also releasing all the ablation models as well as the evaluation code.
HuggingFaceTB/finemath-6763fb8f71b6439b653482c2

reacted to
DualityAI-RebekahBogdanoff's
post with π
2 months ago
Post
1342
π Unlock the Power of Synthetic Data for AI Training! π
DualityAI-RebekahBogdanoff/Synthetic_Data_Object_Detection
Weβve uploaded our Cheerios Detector model- a YOLOv8 model trained using synthetic data to recognize cereal boxes in indoor environments. But we know you can make it even more robust! π‘
See the model in action in our βCheerios detectorβ space, and then take it a step further by using FalconEditor to create custom synthetic data.
https://falcon.duality.ai/secure/documentation?learnWelcome=true&sidebarMode=learn
With FalconEditor, you can generate complex, targeted data that will enhance your model's performance and make it even more accurate, adaptable, and robust! π§ β¨
Donβt just use the data β improve it! Create unique scenarios, train for rare edge cases, and employ tailored conditions that push the boundaries of AI training.
DualityAI-RebekahBogdanoff/Synthetic_Data_Object_Detection
Weβve uploaded our Cheerios Detector model- a YOLOv8 model trained using synthetic data to recognize cereal boxes in indoor environments. But we know you can make it even more robust! π‘
See the model in action in our βCheerios detectorβ space, and then take it a step further by using FalconEditor to create custom synthetic data.
https://falcon.duality.ai/secure/documentation?learnWelcome=true&sidebarMode=learn
With FalconEditor, you can generate complex, targeted data that will enhance your model's performance and make it even more accurate, adaptable, and robust! π§ β¨
Donβt just use the data β improve it! Create unique scenarios, train for rare edge cases, and employ tailored conditions that push the boundaries of AI training.