feat: allow output_hidden_states and output_attensions to record outputs of specific layers #43213

cloudhan · 2026-01-10T15:15:15Z

What does this PR do?

This PR enable model forward to record optional outputs at specified layers. This will be particularly useful for large model with long context when explorering the aesthetics of the attention maps to design sparse attention. Without it, with moderate size model (say 7B), it can easily OOM with only 1k level context.

outputs = model.forward(input_ids, output_hidden_states=10, output_attentions=[10])

now it only keeps outputs.attentions[10], outputs of other layers are set to None to save memory.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…uts of specific layers

github-actions · 2026-01-10T15:29:33Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43213&sha=9f9956

feat: allow output_hidden_states and output_attensions to record outp…

9f9956c

…uts of specific layers

cloudhan force-pushed the record-specified-layers-only branch from 678fc00 to 9f9956c Compare January 10, 2026 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: allow output_hidden_states and output_attensions to record outputs of specific layers #43213

feat: allow output_hidden_states and output_attensions to record outputs of specific layers #43213

cloudhan commented Jan 10, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: allow output_hidden_states and output_attensions to record outputs of specific layers #43213

Are you sure you want to change the base?

feat: allow output_hidden_states and output_attensions to record outputs of specific layers #43213

Conversation

cloudhan commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions bot commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cloudhan commented Jan 10, 2026 •

edited

Loading