OpenAI's GPT-OSS-120B is a groundbreaking open-weight large language model with approximately 117 billion parameters (5.1 billion active), designed to deliver powerful reasoning and agentic capabilities, including code execution and structured outputs. Unlike massive models requiring multiple GPUs, GPT-OSS-120B can run efficiently on a single Nvidia H100 GPU, making local deployment more accessible for organizations and advanced users seeking privacy, low latency, and control.
About 3 min