REVIEWS / AI MODELS / QWEN3.7 PLUS UPDATED JUN 19, 2026 · 63 SOURCES

THE PRODUCT

Qwen3.7 Plus

Qwen3.7 Plus

Strong multimodal and tool-calling performance, but users report session-to-session inconsistency. Competitive with Kimi K2.6; no clear edge over DeepSeek on…

AI MODELS MEDIUM CONFIDENCE

THE VERDICT

5.8

REALITY SCORE · OUT OF 10 · CONFIDENCE MEDIUM

COMPOSED FROM

USERS 5.8 · 60 voices · 100%
CRITICS no published scores yet

SENTIMENT · 63 REVIEWS

+ 28% positive · 47% neutral − 25% negative

OUR VERDICT

WE DON'T RECOMMEND THIS
Score 5.8/10 — no affiliate link by editorial policy
See alternatives →

// Honest verdicts are the whole point. We only monetise products we'd actually recommend.

10 REDDIT 31 YOUTUBE 12 HN 7 PRODUCTHUNT
USER n=63
VIDEO n=3
BRAND AVAILABLE
INTERNET n=0

AT A GLANCE · QUOTABLE

  • Rating: 5.8 / 10 (medium confidence)
  • User voices: 63 across 4 platforms
  • Sentiment: 28% positive · 25% negative
  • Updated: Jun 19, 2026

GYIBB rates the Qwen3.7 Plus 5.8/10 based on 63 user voices from 4 platforms. Confidence: medium. Source: https://gyibb.com/ai-models/qwen3-7-plus

⚠ LIMITED DATA Based on 32 comments and 31 videos

BUY IF

Strong multimodal capabilities — handles screenshots, tool calling, and CAD output in agent loops

  • + Hybrid Thinking Mode lets users toggle between fast responses and deep reasoning
  • + Competitive with Kimi K2.6 on backend feature-planning tasks
  • + Wide model family (0.6B to 235B) with open-weight non-Plus variants for local use

SKIP IF

Erratic session-to-session behavior: same prompt + same file yields wildly different results

  • Plus and Max variants are API-only — no HuggingFace release for self-hosted deployment
  • Pricing opacity at launch; Max variant reportedly 7x cost of Plus with unclear value differential
  • DeepSeek reported as more consistent across environments, eroding Qwen's competitive position

Where the layers disagree

7 CONTRADICTIONS DETECTED

VIDEO titles scream 'SO POWERFUL' and 'WHY IS NO ONE TALKING ABOUT THIS' but USER comments report erratic session-to-session behavior where identical prompts produce disastrously different outputs.

VIDEO VS USER

VIDEO (AICodeKing) tested Qwen 3.7 Max on an elevator simulation and a commenter caught it running incorrectly (all elevators to one floor), while USER comments praise tool-calling — revealing a gap between hype-video framing and actual task reliability.

VIDEO VS USER

USER notes 'no pricing or technical information has been released yet' for 3.7 Plus, but VIDEO comments reveal Qwen3.7-Max costs ~7x more than Qwen3.6-Plus — pricing exists but isn't transparently communicated by the brand.

BRAND VS VIDEO

USER developers want HuggingFace releases for local deployment, but only non-Plus models are open-weighted — Plus and Max variants are API-only, creating an access wall for the most capable models.

USER VS BRAND

USER direct comparison finds Qwen 3.7 Plus and Kimi K2.6 produce 'very close' results — contradicting the VIDEO narrative that Qwen 3.7 Plus is uniquely powerful.

VIDEO VS USER

USER reports DeepSeek is 'much more consistent across different chats, harnesses, environments' than Qwen 3.6/3.7, undermining VIDEO positioning of Qwen as a category leader.

VIDEO VS USER

VIDEO creators (BoxminingAI) saturate descriptions with affiliate links to competing services (Minimax, GLM, Kimi) — raising questions about whether the review is substantive or promotional cross-selling.

VIDEO VS USER

WHERE THEY AGREE +

+ Strong multimodal capabilities — handles screenshots, tool calling, and CAD output in agent loops
+ Hybrid Thinking Mode lets users toggle between fast responses and deep reasoning
+ Competitive with Kimi K2.6 on backend feature-planning tasks
+ Wide model family (0.6B to 235B) with open-weight non-Plus variants for local use

WHERE THEY DON'T

Erratic session-to-session behavior: same prompt + same file yields wildly different results
Plus and Max variants are API-only — no HuggingFace release for self-hosted deployment
Pricing opacity at launch; Max variant reportedly 7x cost of Plus with unclear value differential
DeepSeek reported as more consistent across environments, eroding Qwen's competitive position
Elevator simulation test failure suggests reasoning gaps in multi-agent coordination scenarios

Where the 63 sources came from

VIEW EVERY CITATION →
REDDIT
10
YOUTUBE
31
HN
12
PRODUCTHUNT
7

The four realities

Most review sites collapse everything into one number. We keep the layers separate so you can see where reality bends.

01
USER
n=63 · 4 platforms

What actual buyers say

Users who have tested Qwen3.6/3.7 Plus in agent harnesses report solid multimodal capabilities and tool calling — one user ran it in a carpentry simulator generating CAD files, plans, and build videos, noting it 'performs pretty well, but not at Opus 4.7 levels.' The Hybrid Thinking Mode (toggle between fast and deep reasoning) drew positive attention from ProductHunt users, with the 4B dense model reportedly approaching larger previous-gen performance. However, significant concerns emerged: (1) No pricing or technical info was available at launch for 3.7 Plus. (2) Alibaba does not release Plus or Max model weights to HuggingFace — only non-Plus variants — blocking local deployment for developers who want self-hosted inference. (3) Multiple users expressed desire for an 8-14B parameter model with latest improvements, which hasn't materialized. (4) A direct comparison between Qwen 3.7 Plus and Kimi K2.6 on a .NET backend feature-planning task yielded 'very close' results with 'no clear winner,' though the user preferred Kimi's conciseness. (5) Critically, one developer reported that Qwen 3.6 and 3.7 'really behave erratically every once in a while' — identical prompts on identical files producing disastrously different outputs across fresh sessions, leading them to conclude 'I'm not sure Qwen will ever be able to keep up with Deepseek again... Deepseek is just much more consistent across different chats, harnesses, environments.' API access friction was also noted: Alibaba Cloud's interface was described as discouraging, with users asking whether Qwen has its own API channels separate from Alibaba Cloud.
02
VIDEO
n=31 · YouTube

What reviewers showed on camera

Three YouTube videos covered Qwen3.7, but the substance is thin. AICodeKing (129K subs, 20K views) titled his video 'WHY IS NO ONE TALKING ABOUT THIS!?' — classic hype framing — yet a comment on his own video flagged that the elevator simulation test was wrong: Qwen 3.7 Max moved all three elevators to the same floor with one person, while Gemini 3.5 Flash correctly ran three separate elevators. Another commenter noted 'Qwen3.7-max is about 7 times more expensive than Qwen3.6-plus,' injecting cost reality into the hype. BoxminingAI's video (10.5K subs) was titled 'Qwen 3.7 Plus is SO POWERFUL!' but the description was almost entirely affiliate links (Hostinger, Zeabur, Minimax, GLM) with minimal test substance visible in the excerpt. AI Coding Daily (11.5K subs, 2.8K views) tested it on five projects and a commenter reported parallel-testing Qwen 3.7 Plus and Kimi K2.6, finding results 'very close, no clear winner.' Another commenter criticized the Python benchmark as unrepresentative since 'most of the models are trained in this language.' Across all three videos, no rigorous benchmark methodology was evident, and thinking-mode fairness across models was questioned but not addressed.

Qwen 3.7 Max (+Free API): WHY IS NO ONE TALKING ABOUT THIS!?

AICodeKing · 20,334 views

"[comment] "Free API key" [comment] everything is CRAZY. IT'S CRAZY! CRAZY! [comment] He was never the Guy we thought he would be [comment] Did you look at the contact lens case? [comment] I'm tired boss [comment] Can you do a Pi video give…"

Qwen 3.7 Plus is SO POWERFUL! (Real Tests and Review)

BoxminingAI (Superbash) · 4,342 views

"[comment] ●▬▬▬▬▬▬▬VPS Recommendations▬▬▬▬▬▬▬● 👉🏼 Code BOXMINING for 10% off VPS (annual plan): https://hostinger.com/BOXMINING 👉🏼 Zeabur Server: https://zeabur.com/?ref=boxmining (Save $5 use code: boxmining) ●▬▬▬▬▬▬▬AI Models Recommendati…"

I Tested NEW Qwen3.7-Plus on FIVE Projects

AI Coding Daily · 2,806 views

"[comment] thanks for keeping us updated [comment] Just finished quite a long session of a feature planning for a dotnet backend app with Qwen 3.7 Plus and Kimi K2.6 in parallel. The results are very close, no clear winner. Though I Iike Kim…"

03
INTERNET
n=0 · review sites

What the press said

No aggregate ratings were found for this product during the last harvest.
04
BRAND
official source

What the brand says

no brand page found

The official brand page was not successfully scraped during the last harvest.
Visit Official Site →

SIMILAR IN THIS CATEGORY

See all →
DeepSeek Chat

DeepSeek Chat

9.0

✓ Extremely cost-effective API

GPT-5 mini

GPT-5 mini

8.5

✓ 30-50% faster response times on light queries compared to predecessors

DeepSeek R1

DeepSeek R1

8.0

✓ Strong reasoning capabilities on math and coding benchmarks

OpenAI ChatGPT Plus

OpenAI ChatGPT Plus

8.0

✓ Massive productivity boost for mid-level software engineers

DATA SOURCES & AUDIT

10
REDDIT
31
YOUTUBE
12
HN
7
PRODUCTHUNT
3
YOUTUBE VIDEOS

63 data points across 4 platforms, synthesized via GYIBB's Truth Engine and fact-checked against source data before publication.

CONFIDENCE: MEDIUM · ANALYSED: JUNE 19, 2026 AT 05:06 PM · PROMPT V1.0 · READ METHODOLOGY →

Was this review helpful?

Embed this review

Writing about Qwen3.7 Plus? Add the GYIBB verdict — free, no account needed.

<a href="https://gyibb.com/ai-models/qwen3-7-plus" target="_blank" rel="noopener">
  <img src="https://gyibb.com/badge/ai-models/qwen3-7-plus.svg" alt="GYIBB rating for Qwen3.7 Plus" width="220" height="56">
</a>
← Back to all reviews

Qwen3.7 Plus

GYIBB SCORE: 5.8/10

See alternatives →