Gemini breaks new ground with a faster model, longer context, AI agents and more

1.5 Flash excels at summarization, chat functions, picture and video captioning, knowledge extraction from lengthy paperwork and tables, and extra. It’s because it’s been educated by 1.5 Professional by a course of known as “distillation,” the place essentially the most important data and expertise from a bigger mannequin are transferred to a smaller, extra environment friendly mannequin.

Learn extra about 1.5 Flash on the Gemini know-how web page, and study 1.5 Flash’s availability and pricing. We’ll share extra particulars in an up to date Gemini 1.5 technical report quickly.

Considerably bettering 1.5 Professional

Over the previous couple of months, we’ve considerably improved 1.5 Professional, our greatest mannequin for normal efficiency throughout a variety of duties.

Past extending its context window to 2 million tokens, we’ve enhanced its code technology, logical reasoning and planning, multi-turn dialog, and audio and picture understanding by knowledge and algorithmic advances. We see robust enhancements on public and inner benchmarks for every of those duties.

1.5 Professional can now comply with more and more advanced and nuanced directions, together with ones that specify product-level conduct involving position, format and magnificence. We’ve improved management over the mannequin’s responses for particular use circumstances, like crafting the persona and response fashion of a chat agent or automating workflows by a number of operate calls. And we’ve enabled customers to steer mannequin conduct by setting system directions.

We added audio understanding within the Gemini API and Google AI Studio, so 1.5 Professional can now purpose throughout picture and audio for movies uploaded in Google AI Studio. And we’re now integrating 1.5 Professional into Google merchandise, together with Gemini Superior and in Workspace apps.

Learn extra about 1.5 Professional on the Gemini know-how web page. Extra particulars are coming quickly in our up to date Gemini 1.5 technical report.

Gemini Nano understands multimodal inputs

Gemini Nano is increasing past text-only inputs to incorporate photographs as effectively. Beginning with Pixel, functions utilizing Gemini Nano with Multimodality will have the ability to perceive the world the best way individuals do — not simply by textual content, but additionally by sight, sound and spoken language.

Learn extra about Gemini 1.0 Nano on Android.

Source link

Gemini breaks new ground with a faster model, longer context, AI agents and more

New generative media models and tools, built with and for creators

The Blink Mini 2 security cam is an even better value now that it’s on sale for $30

Related Posts

Zyphra Releases Zamba2-1.2B-Instruct and Zamba2-2.7B-Instruct: A New State-of-the-Art Small Language Model Series that Outperforms Gemma2-2B-Instruct

AI-Powered Corrosion Detection for Industrial Equipment: A Scalable Approach with AWS

Create your fashion assistant application using Amazon Titan models and Amazon Bedrock Agents | Amazon Web Services

Conducting Vulnerability Assessments with AI

Modeling relationships to solve complex problems efficiently

People are using Google study software to make AI podcasts—and they’re weird and amazing

The Blink Mini 2 security cam is an even better value now that it’s on sale for $30

Google I/O 2024: Android to Get Support for These AI-Powered Features

Pro-Palestinian activists protest at Google developer conference amid Israel-Hamas war

Leave a Reply Cancel reply

Mechrevo launches affordable Yao M510 gaming mouse with up to 4800 DPI & triple connectivity – Gizmochina

DJI RC Pro Review (Everything You Need to Know)

Windows 11 24H2 is out! @ AskWoody

Watch the mind-bending new trailer for sci-fi epic ‘3 Body Problem’ (video)

The Explorer 2025 is the first Ford to run its new Android infotainment system

iPhone 16 and iPhone 16 Plus to Get More RAM, Faster Wi-Fi: Report

Google Pixel 9 range tipped for major display brightness upgrade

AALTO achieves milestone HAPS regulation, with Design Organisation Approval from UK Civil Aviation Authority

OpenAI Launches Custom GPT Store: How to Access and Use It Right Now

Amazon boosts Throne and Liberty server caps as players flood to try the free MMORPG

Can you replace the Meta Quest 3S cloth head strap?

Amkor and TSMC sign an MOU to collaborate on advanced chip packaging for AI, HPC, PC, and mobile processors at Amkor's planned ~$2B facility in Peoria, Arizona (Anton Shilov/Tom's Hardware)

If You’ve Already Bought AirPods Pro 2, This Insane Prime Day Price Will Make You Jealous

Google is making it easier to protect your data if your phone gets stolen

Survival hit The Planet Crafter terraforms a whole new world in its first DLC

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password