test: make Tool/Agent serialization assertions version-agnostic for Haystack 2.x/3.x by julian-risch · Pull Request #3533 · deepset-ai/haystack-core-integrations

julian-risch · 2026-07-02T11:12:36Z

Related Issues

Part of deepset-ai/haystack-private#446

Serialization tests in five integrations compared to_dict() output against hard-coded expected dicts that pin the exact Tool/Toolset/Agent serialization format of one haystack-ai version. Haystack 3.0 adds fields (e.g. Tool.to_dict() now includes async_function; Agent gained tool_concurrency_limit, hooks, tool_streaming_callback_passthrough), which broke these tests. This PR makes them version-agnostic so they pass on both 2.x and the current v3 branch:

Proposed Changes:

anthropic, huggingface_api, transformers, watsonx: the to_dict tests no longer pin the haystack-owned tools serialization format at all. They exclude the tools entries from the pinned-dict comparison and instead do a real from_dict(component.to_dict()) round-trip, asserting the recovered tools equal the originals.
aimlapi, cometapi, meta_llama, mistral, openrouter, orcarouter, togetherai: the same for their test_serde_in_pipeline tests — the tools entries are popped from the pinned pipeline dict; tool fidelity is covered by the Pipeline.loads(pipeline.dumps()) round-trip (Pipeline.__eq__ compares serialized forms) plus explicit attribute checks on the loaded generator.
github (test_pipeline_serialization): the test pins the full pipeline dict including Agent and OpenAIChatGenerator init parameters, which belong to haystack-ai and change between versions. It now copies any init parameters the installed haystack-ai adds beyond the expected baseline before comparing, replacing the two existing single-key compat shims (http_client_kwargs/tool_invoker_kwargs pops and the confirmation_strategies/required_variables/user_prompt loop). The test keeps pinning what the integration owns: the GitHubFileEditorTool serialization. It also now asserts Pipeline.from_dict(pipeline.to_dict()) == pipeline.
github: also changed import unittest to import unittest.mock — the test uses unittest.mock.ANY and only worked because haystack-ai 2.x happened to import unittest.mock transitively; with v3 installed it fails with AttributeError: module 'unittest' has no attribute 'mock'.

How did you test it?

Haystack 2.x: fmt-check, test:types, and full unit suites pass for all touched integrations.
Haystack v3 (installed git+https://github.com/deepset-ai/haystack.git@v3): the previously failing serialization tests now pass. Verified on a local merge with test: guard ToolInvoker imports so chat-generator tests run under Haystack 3.0 #3535 (needed for collection) and test: trust test modules under Haystack 3.0's deserialization allowlist #3537 (the serde tests deserialize test-module functions, so Pipeline.loads needs the trusted-modules allowlist) — anthropic, aimlapi, transformers, watsonx, huggingface_api, and github unit suites are all fully green on that combined state.

Notes for the reviewer

Reviewing this uncovered that cometapi's test_serde_in_pipeline never deserialized anything (it compared the original generator to itself), which was hiding a real bug: CometAPIChatGenerator could not be deserialized in a pipeline at all. That fix, together with the missing round-trip, was extracted into fix(cometapi): make CometAPIChatGenerator deserializable in pipelines #3542 (now merged); main has been merged back into this branch, so cometapi's test_serde_in_pipeline here combines both changes.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.

🤖 Generated with Claude Code

Haystack 3.0 extends the serialized format of Tool (async_function) and Agent (tool_concurrency_limit, hooks, ...), breaking tests that compare to_dict() output against hard-coded dicts. Build the expected tool entries with tool.to_dict()/toolset.to_dict() at runtime and, in the github pipeline test, accept init parameters added by the installed haystack-ai version. Also replaces the per-field hasattr/pop compat shims these tests had accumulated for earlier format changes. Co-Authored-By: Claude Fable 5 <noreply@anthropic.com>

github-actions · 2026-07-02T11:13:38Z