Skip to main content Start main content
Nano Banana 2 (Gemini 3.1 Flash Image)

01

 

Google released Gemini 3.1 Flash Image (Nano Banana 2) in February 2026.

Nano Banana 2 brings the high-speed intelligence of Gemini Flash to visual generation, making rapid edits and iteration possible. It also makes once-exclusive Pro features accessible to a wider audience, including:

  • Advanced world knowledge: The model draws from Gemini’s real-world knowledge base and is powered by real-time information and images from web search to render specific subjects more accurately. This deep understanding also helps you create infographics, turn notes into diagrams and generate data visualizations.
  • Precision text rendering and translation: Nano Banana 2 allows you to generate accurate, legible text for marketing mock-ups or greeting cards. You can even translate and localize text within an image to share your ideas globally.
  • Subject consistency: Maintain the resemblance of up to five characters and the fidelity of up to 14 objects in a single workflow, allowing you to storyboard and build narratives without altering the appearance of your inputs.
  • Precise instruction following: With enhanced instruction following, the model adheres more strictly to your complex requests, capturing the specific nuances of your idea so the image you get is the image you asked for.
  • Production-ready specs: Create attention-grabbing assets with full control over various aspect ratios and resolutions from 512px to 4K, ensuring your visuals stay sharp whether they are for a vertical social post or a widescreen backdrop.
  • Visual fidelity upgrade: Nano Banana 2 delivers vibrant lighting, richer textures and sharper details, maintaining high-quality aesthetics at the speed expected from Flash.

The PolyU Gen AI app released the Gemini 3.5 Flash model on 1 June 2026, replacing Gemini 3 Flash Preview. For more information on Gemini 3.1 Flash Image (Nano Banana 2), please refer to Google’s official documents, Build with Nano Banana 2 and Prompting guide to Nano Banana.

 

 

Qwen 3.7 Plus

02

Alibaba Cloud released Qwen3.7-Plus in June 2026, a multimodal agent model that unifies vision and language into a single, versatile agent foundation. Building on Qwen3.7’s strong text backbone, Qwen3.7-Plus delivers a comprehensive upgrade in vision-language capabilities while retaining its full agentic strength in coding, tool use, and productivity workflows.

What sets Qwen3.7-Plus apart is its ability to operate as a multimodal, interactive hybrid agent. It perceives real-world scenes, reads screens and operates GUIs; writes code from visual references, navigates mobile apps end-to-end; and answers visual questions grounded in web knowledge, seamlessly blending GUI and CLI interactions within a single agent loop.

As a versatile coding agent and productivity assistant, it handles the full spectrum of tasks, from frontend prototyping to complex software engineering and multi-step workflow automation, with full-modality input.

The PolyU Gen AI app released Qwen3.7-Plus (2026-05-26) on 1 July 2026, replacing Qwen3.6-Plus (2026-04-02). For details on Qwen3.7-Plus, please refer to the Qwen3.7-Plus Official Blog.

 

 

Mistral-Medium-3.5-128B

03

Mistral AI released its first flagship merged model, Mistral-Medium-3.5-128B, in April 2026. It is a dense 128B model with a 256k context window, handling instruction-following, reasoning, and coding within a single set of weights. Users can expect better performance in instruction-following, reasoning, and coding tasks compared with Mistral AI’s previously released models.

 

Mistral Medium 3.5 offers the following capabilities:

  • Reasoning Mode: Toggle between fast instant-reply mode and reasoning mode, boosting performance with test-time compute when requested.
  • Vision: Analyzes images and provides insights based on visual content, in addition to text.
  • Multilingual: Supports dozens of languages, including English, French, Spanish, German, Italian, Portuguese, Dutch, Chinese, Japanese, Korean, and Arabic.
  • System Prompt: Strong adherence to and support for system prompts.
  • Agentic: Best-in-class agentic capabilities with native function calling and JSON output.
  • Large Context Window: Supports a 256k context window.

The PolyU Gen AI app released Mistral-Medium-3.5-Large on 1 July 2026, replacing Magistral-Small-2509. For details of Mistral-Medium-3.5-128B, please refer to the Mistral-Medium-3.5-128B Hugging Face Repository.

 

Your browser is not the latest version. If you continue to browse our website, Some pages may not function properly.

You are recommended to upgrade to a newer version or switch to a different browser. A list of the web browsers that we support can be found here