Qwen 3.5-397B Model Theft: Rio's LLM Exposed as Rebranded Clone

Overview

On 14 June 2026, researchers from Nex-AGI published a GitHub issue demonstrating that Rio-3.5-Open-397B — a large language model presented by IplanRIO (the technology arm of the Rio de Janeiro city government) as an original, domestically developed model — is in fact an undisclosed element-wise weight merge of two existing models: Nex-N2_pro and the official Qwen3.5-397B-A17B base, blended at approximately a 0.6/0.4 ratio. No evidence of independent training by IplanRIO was found.

The case is notable not only as an IP violation but as a concrete example of how model weight merging can be weaponised to fabricate provenance — a growing concern in public-sector AI procurement and open-source model governance.

Technical Analysis

Nex-AGI presented two independent lines of evidence:

1. Identity Probing With Rio’s hard-coded "You are Rio" system prompt stripped, the deployed model self-identified as “Nex, from Nex-AGI” in 79% of test queries — and as “Rio” in 0% of cases. The model also reproduced Nex-AGI’s proprietary organisational backstory verbatim. This technique — sometimes called system prompt extraction via identity probing — exploits the tendency of instruction-tuned models to revert to base persona when overriding system prompts are absent.

2. Tensor-Level Statistical Analysis Every weight tensor across all 60 transformer layers was found to be, to thousands of standard deviations of confidence, the linear combination:

Rio_weight = 0.6 × Nex_weight + 0.4 × Qwen_weight

This is consistent with standard model merging tools (e.g., mergekit) and is statistically irreconcilable with independent training or even standard fine-tuning from either base model. The researchers note that other known fine-tunes of either base model cannot be explained as such interpolations.

Framework Mapping

AML.T0010 – ML Supply Chain Compromise: The model was introduced into a government service under false provenance, corrupting the integrity of the AI supply chain for public-sector deployments.
AML.T0044 – Full ML Model Access: The attacker required full access to Nex model weights to perform the merge, suggesting either a licence violation or an access control failure on the original model’s distribution.
AML.T0056 – LLM Meta Prompt Extraction: The identity probing technique used as evidence is itself a real-world demonstration of meta prompt and persona extraction.
LLM05 – Supply Chain Vulnerabilities: The downstream government deployment inherited undisclosed third-party model components.
LLM10 – Model Theft: The core violation is the uncredited, undisclosed incorporation of a proprietary model’s weights.

Impact Assessment

Nex-AGI: Direct IP and reputational harm; their model is deployed in a government context without attribution or licensing agreement.
Public sector users: Citizens and agencies relying on Rio-3.5 were exposed to a model with undisclosed provenance, unknown safety alignment, and potentially misrepresented capabilities.
AI governance broadly: The incident undermines trust in government AI capability claims and highlights the ease with which weight merging can obscure model origins.

Mitigation & Recommendations

Cryptographic model fingerprinting: Embed unforgeable watermarks or fingerprints at training time to survive downstream merges and detect reuse.
Mandatory model cards with reproducible training provenance: Procurement processes should require verifiable training logs, compute receipts, and dataset attestations.
Identity and persona stress-testing: Before deploying any LLM, probe identity under varied system prompt conditions to surface undisclosed base model personas.
Licence enforcement for open-weight models: Model distributors should enforce licence terms (including attribution requirements) through both legal and technical mechanisms.

References

GitHub Issue #4 – nex-agi/Nex-N2