bug: `generate_from_raw` hanging when ollama under load

In the process of testing #567 I encountered an issue where `test/core/test_component_typing.py::test_generating` was timing out after 15 minutes and failing. Seems like I had overloaded ollama by running `for i in {1..10}; do uv run pytest test/core -v; done` and the test failed on the fourth or fifth run. 

Doing a little bit of digging, it seems like the issue happened here: 

https://github.com/generative-computing/mellea/blob/c4cb59f520f141818676a771bf94dbbdaf58305a/mellea/backends/ollama.py#L466

Because there's no timeout, if Ollama stalls on any of the concurrent requests (like it seems happened when I ran this test over and over and over again) the entire call hangs indefinitely. 

There's a couple of different ways we could fix this, but probably the easiest would be to set a timeout on the underlying `ollama.AsyncClient`:

https://github.com/generative-computing/mellea/blob/c4cb59f520f141818676a771bf94dbbdaf58305a/mellea/backends/ollama.py#L201

Ideally we'd make the timeout configurable too (so users could set it when creating the backend?)



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: `generate_from_raw` hanging when ollama under load #650

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

bug: generate_from_raw hanging when ollama under load #650

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

bug: `generate_from_raw` hanging when ollama under load #650