Adding adapter tests for Qwen2 by Rishik00 · Pull Request #1309 · TransformerLensOrg/TransformerLens

Rishik00 · 2026-05-18T11:27:03Z

Description

Adds unit coverage for the Qwen2 architecture adapter.

Changes

Tests Qwen2 config defaults, component mapping, and HF module paths.
Verifies Q/K/V/O weight conversions use the right head counts for GQA.
Adds download-free fake attention coverage for Qwen2 GQA hook shapes.
Checks factory registration for Qwen2ForCausalLM.

To run the test: uv run pytest tests\unit\model_bridge\supported_architectures\test_qwen2_adapter.py

* Fix type of HookedTransformerConfig.device This is typed as `Optional[str]` but sometimes returns `torch.device`. Updated the code to just return the `str` instead of wrapping with a device. I'm not confident that every function which takes a device will always be passed a string, so I didn't change functions like warn_if_mps. Found while working on TransformerLensOrg#1219 * more cleanup * 3.0 CI Bugs (TransformerLensOrg#1261) * Fixing `utils` imports * skip gated notebooks on PR from forks * Updating notebooks * Ensure LLaMA only runs when HF_TOKEN is available --------- Co-authored-by: jlarson4 <jonahalarson@comcast.net>

TransformerLens 3.1.0

Release v3.2.0

Release v3.2.1

jlarson4

Hi @Rishik00! This is an excellent test suite, I left one comment on the code and have one additional ask:

Can we get some testing for Qwen2's setup_component_testing override? That is the biggest feature that currently isn't getting tested. If you'd like a reference, another contributor has a great example of roughly what I'm looking for in #1311

jlarson4 · 2026-05-18T15:25:28Z

+    return Qwen2ArchitectureAdapter(cfg)
+
+
+class FakeQwen2Attention(nn.Module):


This is excellent, and it would be great to write some tests that use it, but at present it does not appear wired into anything?

My apologies. Will wire it up and update the PR!

Rishik00 · 2026-05-18T18:44:16Z

Of course! I will get that added

Rishik00 · 2026-05-20T21:32:10Z

Apologies for the dummy files commit. I had those accidentally appear from elsewhere, will not let that happen again

jlarson4 · 2026-05-20T21:35:17Z

@Rishik00 no worries, as long as they're cleaned up they won't be in the final merge! I will review once CI completes

Rishik00 · 2026-05-20T21:36:16Z

Sure, thank you very much!

jlarson4 · 2026-05-21T04:27:12Z

Hi @Rishik00! This is great work, I am going to squash merge this now. This will prevent those furunpod files from being introduced to main in the future

Make sure to invalidate any keys that may have been included in your initial commits, since they were committed to a public repository they are compromised.

brendanlong and others added 6 commits April 20, 2026 14:50

Merge pull request TransformerLensOrg#1277 from TransformerLensOrg/dev

6f56518

TransformerLens 3.1.0

Merge pull request TransformerLensOrg#1294 from TransformerLensOrg/dev

31d4f6a

Release v3.2.0

Merge pull request TransformerLensOrg#1295 from TransformerLensOrg/dev

5f7b02e

Release v3.2.1

qwen2 adapter tests

4570fe6

qwen 2 adapter tests

2f8a436

jlarson4 reviewed May 18, 2026

View reviewed changes

jlarson4 changed the base branch from main to dev May 18, 2026 15:33

along-l mentioned this pull request May 19, 2026

Add GPT-J architecture adapter tests #1314

Merged

7 tasks

Rishik00 added 3 commits May 21, 2026 02:55

linting

6d08e16

wired up unused attention module

d890020

removed dummy files

6f8f16c

jlarson4 merged commit 4efd766 into TransformerLensOrg:dev May 21, 2026
24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding adapter tests for Qwen2#1309

Adding adapter tests for Qwen2#1309
jlarson4 merged 9 commits into
TransformerLensOrg:devfrom
Rishik00:qwen-adapter-test

Rishik00 commented May 18, 2026

Uh oh!

jlarson4 left a comment

Uh oh!

jlarson4 May 18, 2026

Uh oh!

Rishik00 May 18, 2026

Uh oh!

Rishik00 commented May 18, 2026

Uh oh!

Rishik00 commented May 20, 2026

Uh oh!

jlarson4 commented May 20, 2026

Uh oh!

Rishik00 commented May 20, 2026

Uh oh!

jlarson4 commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		return Qwen2ArchitectureAdapter(cfg)


		class FakeQwen2Attention(nn.Module):

Conversation

Rishik00 commented May 18, 2026

Description

Changes

Uh oh!

jlarson4 left a comment

Choose a reason for hiding this comment

Uh oh!

jlarson4 May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Rishik00 May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Rishik00 commented May 18, 2026

Uh oh!

Rishik00 commented May 20, 2026

Uh oh!

jlarson4 commented May 20, 2026

Uh oh!

Rishik00 commented May 20, 2026

Uh oh!

jlarson4 commented May 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants