Mohammed Mohammed Ali

MohammedEltoum

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

JournalistsonHF/ai-toolkit

upvoted a collection 3 days ago

olmOCR

reacted to openfree's post with ❤️ 4 days ago

Datasets Convertor 🚀 https://huggingface.co./spaces/openfree/Datasets-Convertor Welcome to Datasets Convertor, the cutting-edge solution engineered for seamless and efficient data format conversion. Designed with both data professionals and enthusiasts in mind, our tool simplifies the transformation process between CSV, Parquet, and JSONL, XLS file formats, ensuring that your data is always in the right shape for your next analytical or development challenge. 💻✨ Why Choose Datasets Convertor? In today’s data-driven world, managing and converting large datasets can be a daunting task. Our converter is built on top of robust technologies like Pandas and Gradio, delivering reliable performance with a modern, intuitive interface. Whether you’re a data scientist, analyst, or developer, Datasets Convertor empowers you to effortlessly switch between formats while maintaining data integrity and optimizing storage. Key Features and Capabilities: CSV ⇆ Parquet Conversion: Easily transform your CSV files into the highly efficient Parquet format and vice versa. Parquet’s columnar storage not only reduces file size but also accelerates query performance—a critical advantage for big data analytics. 🔄📂 CSV to JSONL Conversion: Convert CSV files to JSONL (newline-delimited JSON) to facilitate efficient, line-by-line data processing. This format is particularly useful for streaming data applications, logging systems, and scenarios where incremental data processing is required. Each CSV row is meticulously converted into an individual JSON record, preserving all the metadata and ensuring compatibility with modern data pipelines. 📄➡️📝 Parquet to JSONL Conversion: For those working with Parquet files, our tool offers a streamlined conversion to JSONL. Parquet to XLS Conversion.

View all activity

Organizations

MohammedEltoum's activity

upvoted a collection 3 days ago

olmOCR

Collection

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 2 days ago • 46

upvoted a paper 20 days ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 22 days ago • 33

upvoted a paper 22 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 24 days ago • 195

upvoted a paper 24 days ago

AIN: The Arabic INclusive Large Multimodal Model

Paper • 2502.00094 • Published 28 days ago • 16

upvoted a paper 27 days ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published Jan 27 • 18

upvoted a paper 3 months ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 29

upvoted a paper 5 months ago

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4, 2024 • 36

upvoted a collection 5 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 18 days ago • 297

upvoted a paper 5 months ago

CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling

Paper • 2409.19291 • Published Sep 28, 2024 • 19

upvoted a collection 5 months ago

Emu3

Collection

Emu3: Next-Token Prediction is All You Need • 7 items • Updated 15 days ago • 69

upvoted 3 papers 5 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Paper • 2409.08513 • Published Sep 13, 2024 • 14

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13

upvoted a paper 6 months ago

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Paper • 2408.02555 • Published Aug 5, 2024 • 29