ByteDance is the China-based technology company best known globally for TikTok. Its AI research runs through the ByteDance Seed team, formed in 2023, which builds foundation models for both ByteDance's own products and the open-sourceA stricter standard than open-weight: the weights, the training code, and the training data are all released publicly. Very few large language models meet the full open-source bar — most "open" models in the AI world are actually open-weight. When in doubt, check the license file and the creator's documentation. community. The team's open releases ship on Hugging Face and GitHub, and its most prominent open model to date is Seed-OSS-36B — a 36-billion-parameter model with a notably large 512K-tokenThe basic unit of text a model reads and writes. Tokens are roughly three-quarters of a word in English — so 100 tokens is about 75 words. Models don't see letters or words directly; they see tokens. Pricing is almost always quoted per million tokens, and context windows are measured in tokens rather than words. context windowThe maximum amount of text the model can "see" at once — prompt plus prior conversation plus any documents you give it. Measured in tokens (which are roughly three-quarters of a word each). A 128K context window is about 96,000 words of input — roughly a 400-page book. Larger context windows let the model work with bigger documents but cost more to run., released in August 2025 under Apache 2.0.
For business readers, the relevant distinction is between ByteDance's open and closed work. The open Seed-OSS line is freely downloadable and self-hostable under a permissive license; ByteDance also runs closed, hosted products (such as the Doubao assistant, used heavily in China), which aren't cataloged here yet. This entry is an initial one focused on Seed-OSS as a foundation and fine-tuning base.