Add video file support #405

altxtech · 2025-09-11T20:48:20Z

What this does

Adds video file support to RubyLLM.

Supersedes #260, originally authored by @arnodirlam. Thank you for the groundwork.

Maintainers: happy to close this if you prefer waiting for the original PR.

What changed vs #260

Rebased on current main
Resolved conflicts in README/docs and Gemini capabilities
Addressed PR comment reviews
Included gemini-2.5-flash as a video model for tests

Type of change

Scope check

I read the Contributing Guide
This aligns with RubyLLM's focus on LLM communication
This isn't application-specific logic that belongs in user code
This benefits most users, not just my specific use case

Quality check

I ran overcommit --install and all hooks pass
I tested my changes thoroughly
- For provider changes: Re-recorded VCR cassettes with bundle exec rake vcr:record[provider_name]
- All tests pass: bundle exec rspec
I updated documentation if needed
I didn't modify auto-generated files manually (models.json, aliases.json)

API changes

Breaking change
New public methods/classes
Changed method signatures
No API changes

Related issues

Closes #259

…; Update capabilities

…pport

tpaulshippy · 2025-09-12T16:06:08Z

spec/ruby_llm/chat_content_spec.rb

+        response = chat.ask('What do you see in this video?', with: { video: video_path })
+
+        expect(response.content).to be_present
+        expect(response.content).not_to include('RubyLLM::Content')


Can we add an expectation that it recognizes the actual content of the video, like at least includes the words "woman" and "beach"?

I don't think so, 2 reasons:

A bit out of scope. No other tests of this spec do this. If we were to make that change, it would better to do it for all tests for consistency, probably on a separate PR

My understanding is these are more of a boundary interface test. In other words, we are testing if the lib correctly interacts with the providers (sends valid requests), and obtains responses. We are not testing the capability of the models themselves.

But I'll wait on more comments. If more people thinks it makes sense, I can add the assertions.

You're right about it not existing in this spec file. I'm a bit surprised about this though as it does exist in the spec for the text models.

ruby_llm/spec/ruby_llm/chat_spec.rb

Line 16 in fa10f0c

expect(response.content).to include('4')

ruby_llm/spec/ruby_llm/chat_spec.rb

Line 37 in fa10f0c

expect(first.content).to include('Matz')

ruby_llm/spec/ruby_llm/chat_spec.rb

Line 40 in fa10f0c

expect(followup.content).to include('199')

I do like how this makes the specs ensure that the models being used actually accomplish the user's intended purpose. But not a showstopper for me.

You have a good point, but there is also some common wisdom that a project does not need to test the functionality of its external dependencies.

This depends a little on the testing philosophy of this project, I prefer to wait on maintainer feedback before making this change.

This is a good question to ponder. There's value in adding the checks for content as it tests if the LLM has actually received the file. The only problem is understanding: we don't want to test that. Since I don't think we can separate the two I'm leaning towards checking the content too. We should probably add similar content checks to the rest of the spec, but perhaps in another PR.

crmne · 2025-09-13T17:58:24Z

What a great PR! Clean, focused, following the spirit of the project.

I would have loved support in other providers as well, if you have the API keys.

codecov · 2025-09-13T17:58:37Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.37%. Comparing base (078ef25) to head (56ece26).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #405      +/-   ##
==========================================
+ Coverage   84.29%   84.37%   +0.08%     
==========================================
  Files          36       36              
  Lines        1897     1907      +10     
  Branches      493      495       +2     
==========================================
+ Hits         1599     1609      +10     
  Misses        298      298

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

altxtech · 2025-09-13T18:14:50Z

I'm glad you enjoyed it! I love when my PRs are well received.

Don't have other API keys, unfortunately. Think my contributions will be limited to Google/Gemini, at least for now.

gobijan · 2025-09-14T07:35:17Z

I'm glad you enjoyed it! I love when my PRs are well received.

Don't have other API keys, unfortunately. Think my contributions will be limited to Google/Gemini, at least for now.

Thx @arnodirlam for doing the hard work for video support. Thx @altxtech for polishing the last remaining bits.
Great to get this in now :)

crmne · 2025-09-14T09:00:14Z

I added VCRs for VertexAI and couldn't find any other major provider supporting video input directly so let's ship this!

arnodirlam · 2025-09-14T09:53:16Z

I added VCRs for VertexAI and couldn't find any other major provider supporting video input directly so let's ship this!

Nice! 🎉 Yea, I also did try with all providers we have, and Gemini was the only one supporting video input.

arnodirlam and others added 9 commits September 11, 2025 16:58

Add video file support to attachments

7a8daa1

Update docs

81c8aba

Add tests to RubyLLM::Chat (instead of RubyLLM::ActiveRecord::ActsAs)…

a218e70

…; Update capabilities

Add models supports_video? helper; Docs: Clarify vision vs video su…

4a8a5f5

…pport

add documentation in appropriate files; apply review comments

710c382

Add gemini-2.5-flash to video models to test

bf490b8

changes as per PR review comment on crmne#260

feeb133

remove unnecessary rubocop:disable rule

7d1d5f4

remove trailing comma

a97e059

arnodirlam mentioned this pull request Sep 12, 2025

Add video input file support #260

Closed

17 tasks

altxtech marked this pull request as ready for review September 12, 2025 11:08

tpaulshippy reviewed Sep 12, 2025

View reviewed changes

tpaulshippy mentioned this pull request Sep 12, 2025

Add video file support tpaulshippy/ruby_llm_community#17

Merged

Merge branch 'main' into feature/video-file-support

56ece26

Merge branch 'main' into feature/video-file-support

27064cc

crmne added 3 commits September 14, 2025 10:50

Merge branch 'main' into feature/video-file-support

c95c4d1

Added VertexAI tests

68b27cd

Merge branch 'main' into feature/video-file-support

ff96a42

crmne merged commit 4ff2231 into crmne:main Sep 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add video file support #405

Add video file support #405

altxtech commented Sep 11, 2025 •

edited

Loading

Uh oh!

tpaulshippy Sep 12, 2025

Uh oh!

altxtech Sep 12, 2025 •

edited

Loading

Uh oh!

tpaulshippy Sep 12, 2025

Uh oh!

altxtech Sep 12, 2025

Uh oh!

crmne Sep 13, 2025

Uh oh!

crmne commented Sep 13, 2025

Uh oh!

codecov bot commented Sep 13, 2025 •

edited

Loading

Uh oh!

altxtech commented Sep 13, 2025

Uh oh!

gobijan commented Sep 14, 2025

Uh oh!

crmne commented Sep 14, 2025

Uh oh!

arnodirlam commented Sep 14, 2025

Uh oh!

Uh oh!

Uh oh!

Add video file support #405

Add video file support #405

Conversation

altxtech commented Sep 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this does

What changed vs #260

Type of change

Scope check

Quality check

API changes

Related issues

Uh oh!

tpaulshippy Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

altxtech Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tpaulshippy Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

altxtech Sep 12, 2025

Choose a reason for hiding this comment

Uh oh!

crmne Sep 13, 2025

Choose a reason for hiding this comment

Uh oh!

crmne commented Sep 13, 2025

Uh oh!

codecov bot commented Sep 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

altxtech commented Sep 13, 2025

Uh oh!

gobijan commented Sep 14, 2025

Uh oh!

crmne commented Sep 14, 2025

Uh oh!

arnodirlam commented Sep 14, 2025

Uh oh!

Uh oh!

altxtech commented Sep 11, 2025 •

edited

Loading

altxtech Sep 12, 2025 •

edited

Loading

codecov bot commented Sep 13, 2025 •

edited

Loading