Hugging Face and IBM shipped Granite Embedding Multilingual R2[1] on May 14, an Apache 2.0–licensed embedding model with 32,768-token context and sub-100M parameters. The model claims best-in-class retrieval quality among small-footprint embeddings, targeting edge deployment and on-premises workloads where API-only offerings from OpenAI, Cohere, and Anthropic can't run. The release includes three variants (30M, 60M, and 100M parameters) and native support for 110 languages, a span that matters for non-English enterprise search and RAG stacks in regulated jurisdictions that prohibit cross-border data movement. This is the third major release from Hugging Face in ten days—following the Ettin Reranker family on May 19 and the Nemotron-Labs diffusion language model demo with NVIDIA on May 23—and the pattern is deliberate. The company is moving up the stack from model hosting into inference primitives. Embedding and reranking are infrastructure, not creativity; they're the pipes beneath every RAG application, every semantic search engine, every recommendation system. By releasing state-of-the-art open models in these categories, Hugging Face is ensuring that the default inference path runs through its ecosystem—Hub for weights, Transformers for code, Inference Endpoints for deployment—rather than through closed APIs. The 32K context window is the real tell. Most embedding models stop at 512 or 8,192 tokens; extending to 32K lets developers embed entire documents, long-form contracts, research papers, and multi-turn conversations without chunking. That's a step-function reduction in retrieval complexity for legal, biomedical, and financial applications, and it shifts the trade-off away from "send everything to OpenAI's ada-002 API" toward "run a 60M-parameter model on-prem and keep sensitive data internal." The Apache 2.0 license removes the last friction: no attribution requirements, no revenue caps, no model-output restrictions. This is designed to be forked, fine-tuned, and deployed inside every enterprise data stack without negotiating terms.