Skip to content

GPT-OSS has the wrong value of yarn_original_max_position_embeddings #14686

@kzhou92

Description

@kzhou92

The GPT120B config here sets

yarn_original_max_position_embeddings: int = 131072

However, according to huggingface config, it should be

"original_max_position_embeddings": 4096,

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions