1 39 2

Ksenia Se

Kseniase

https://www.turingpost.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 minutes ago

TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs

upvoted a paper 14 minutes ago

Diverse Preference Optimization

upvoted a paper 36 minutes ago

WARP: An Efficient Engine for Multi-Vector Retrieval

View all activity

Articles

🌁#85: Curiosity, Open Source, and Timing: The Formula Behind DeepSeek’s Phenomenal Success

6 days ago

• 6

🦸🏻#6: The Role of Profiling in Agentic Workflows

7 days ago

• 3

🦸🏻#5: Building Blocks of Agentic Systems

8 days ago

• 3

Topic 24: What is Cosmos World Foundation Model Platform?

10 days ago

• 6

🌁#84: Could Program Synthesis Unlock AGI?

13 days ago

• 4

Topic 23: What is LLM Inference, it's challenges and solutions for it

16 days ago

• 5

🦸🏻#7: From Agentic AI to Physical AI

22 days ago

• 7

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

24 days ago

• 7

AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI

29 days ago

• 3

🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows

Dec 28, 2024

• 10

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

Dec 25, 2024

• 7

🌁#81: Key AI Concepts to Follow in 2025

Dec 23, 2024

• 24

Organizations

Posts 8

Post

713

8 Free Sources on Reinforcement Learning

With the phenomenon of DeepSeek-R1's top reasoning capabilities, we all saw the true power of RL. At its core, RL is a type of machine learning where a model/agent learns to make decisions by interacting with an environment to maximize a reward. RL learns through trial and error, receiving feedback in the form of rewards or penalties.

Here's a list of free sources that will help you dive into RL and how to use it:

1. "Reinforcement Learning: An Introduction" book by Richard S. Sutton and Andrew G. Barto -> https://web.stanford.edu/class/psych209/Readings/SuttonBartoIPRLBook2ndEd.pdf

2. Hugging Face Deep Reinforcement Learning Course -> https://huggingface.co./learn/deep-rl-course/unit0/introduction
You'll learn how to train agents in unique environments, using best libraries, share your results, compete in challenges, and earn a certificate.

3. OpenAI Spinning Up in Deep RL -> https://spinningup.openai.com/en/latest/index.html
A comprehensive overview of RL with many useful resources

4. "Reinforcement Learning and Optimal Control" books, video lectures and course material by Dimitri P. Bertsekas from ASU -> https://web.mit.edu/dimitrib/www/RLbook.html
Explores approximate Dynamic Programming (DP) and RL with key concepts and methods like rollout, tree search, and neural network training for RL and more.

5. RL Course by David Silver (Google DeepMind) -> https://www.youtube.com/watch?v=2pWv7GOvuf0&list=PLqYmG7hTraZDM-OYHWgPeb
Many recommend these video lectures as a good foundation

6. RL theory seminars -> https://sites.google.com/view/rltheoryseminars/home?authuser=0
Provides virtual seminars from different experts about RL advancements

7. "Reinforcement Learning Specialization" (a 4-course series on Coursera) -> https://www.coursera.org/learn/fundament

8. Concepts: RLHF, RLAIF, RLEF, RLCF -> https://www.turingpost.com/p/rl-f
Our flashcards easily explain what are these four RL approaches with different feedback

Post

2924

7 Open-source Methods to Improve Video Generation and Understanding

AI community is making great strides toward achieving the full potential of multimodality in video generation and understanding. Last week studies showed that working with videos is now one of the main focuses for improving AI models. Another highlight of the week is that open source, once again, proves its value. For those who were impressed by DeepSeek-R1, we’re with you!

Today, we’re combining these two key focuses and bringing you a list of open-source methods for better video generation and understanding:

1. VideoLLaMA 3 model: Excels in various video and image tasks thanks to vision-centric training approach. VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding (2501.13106)

2. FILMAGENT framework assigns roles to multiple AI agents, like a director, screenwriter, actor, and cinematographer, to automate the filmmaking process in 3D virtual environments. FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces (2501.12909)

3. Improving Video Generation with Human Feedback (2501.13918) proposes a new VideoReward Model and approach that uses human feedback to refine video generation models.

4. DiffuEraser video inpainting model, based on stable diffusion, is designed to fill in missing areas with detailed, realistic content and to ensure consistent structures across frames. DiffuEraser: A Diffusion Model for Video Inpainting (2501.10018)

5. MAGI is a hybrid video gen model that combines masked and casual modeling. Its key innovation, Complete Teacher Forcing (CTF), conditions masked frames on fully visible frames. Taming Teacher Forcing for Masked Autoregressive Video Generation (2501.12389)

6. Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise (2501.08331) proposes motion control, allowing users to guide how objects or the camera move in generated videos. Its noise warping algorithm replaces random noise in videos with structured noise based on motion info.

7. Video Depth Anything model estimates depth consistently in super-long videos (several minutes or more) without sacrificing quality or speed. Video Depth Anything: Consistent Depth Estimation for Super-Long Videos (2501.12375)

View all posts

Collections 1

models

None public yet

datasets

None public yet

Ksenia Se

AI & ML interests

Recent Activity

Articles

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

🦸🏻#8: Rewriting the Rules of Knowledge: How Modern Agents Learn to Adapt

🅰️ℹ️ 1️⃣0️⃣1️⃣ The Keys to Prompt Optimization

🌁#85: Curiosity, Open Source, and Timing: The Formula Behind DeepSeek’s Phenomenal Success

🦸🏻#6: The Role of Profiling in Agentic Workflows

🦸🏻#5: Building Blocks of Agentic Systems

Topic 24: What is Cosmos World Foundation Model Platform?

🌁#84: Could Program Synthesis Unlock AGI?

Topic 23: What is LLM Inference, it's challenges and solutions for it

🌁#83: GAN is back

🦸🏻#7: From Agentic AI to Physical AI

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?

🌁#82: AI and ML in Real Life

AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI

🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

🌁#81: Key AI Concepts to Follow in 2025

Organizations

Posts 8

Collections 1

Exploring the Best Practices of Query Expansion with Large Language Models

Robust Prompt Optimization for Large Language Models Against Distribution Shifts

Optimizing Query Generation for Enhanced Document Retrieval in RAG

Multi-Aspect Reviewed-Item Retrieval via LLM Query Decomposition and Aspect Fusion

models

datasets

Ksenia Se

AI & ML interests

Recent Activity

Articles

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

🦸🏻#8: Rewriting the Rules of Knowledge: How Modern Agents Learn to Adapt

🅰️ℹ️ 1️⃣0️⃣1️⃣ The Keys to Prompt Optimization

🌁#85: Curiosity, Open Source, and Timing: The Formula Behind DeepSeek’s Phenomenal Success

🦸🏻#6: The Role of Profiling in Agentic Workflows

🦸🏻#5: Building Blocks of Agentic Systems

**Topic 24: What is Cosmos World Foundation Model Platform?**

🌁#84: Could Program Synthesis Unlock AGI?

Topic 23: What is LLM Inference, it's challenges and solutions for it

🌁#83: GAN is back

🦸🏻#7: From Agentic AI to Physical AI

🅰️ℹ️ 1️⃣0️⃣1️⃣ **What is HtmlRAG, Multimodal RAG and Agentic RAG?**

🌁#82: AI and ML in Real Life

AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI

🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

🌁#81: Key AI Concepts to Follow in 2025

Organizations

Posts 8

Collections 1

models

datasets

Topic 24: What is Cosmos World Foundation Model Platform?

🅰️ℹ️ 1️⃣0️⃣1️⃣ What is HtmlRAG, Multimodal RAG and Agentic RAG?