ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

Trendy diffusion-based picture generative fashions have made vital progress and turn into promising to complement coaching knowledge for the article detection process. Nevertheless, the era high quality and the controllability for complicated scenes containing multi-class objects and dense objects with occlusions stay restricted. This paper presents ODGEN, a novel technique to generate high-quality photographs conditioned on bounding bins, thereby facilitating knowledge synthesis for object detection. Given a domain-specific object detection dataset, we first fine-tune a pre-trained diffusion mannequin on each cropped foreground objects and whole photographs to suit goal distributions. Then we suggest to regulate the diffusion mannequin utilizing synthesized visible prompts with spatial constraints and object-wise textual descriptions. ODGEN displays robustness in dealing with complicated scenes and particular domains. Additional, we design a dataset synthesis pipeline to judge ODGEN on 7 domain-specific benchmarks to show its effectiveness. Including coaching knowledge generated by ODGEN improves as much as 25.3% mAP@.50:.95 with object detectors like YOLOv5 and YOLOv7, outperforming prior controllable generative strategies. As well as, we design an analysis protocol based mostly on COCO-2014 to validate ODGEN basically domains and observe a bonus as much as 5.6% in mAP@.50:.95 in opposition to present strategies.

Source link

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

The hacking group RansomHub claims to be behind the attack that hit Christie's, and threatens to release sensitive information about the auction house's clients (Zachary Small/New York Times)

Symflower Launches DevQualityEval: A New Benchmark for Enhancing Code Quality in Large Language Models

Related Posts

Zyphra Releases Zamba2-1.2B-Instruct and Zamba2-2.7B-Instruct: A New State-of-the-Art Small Language Model Series that Outperforms Gemma2-2B-Instruct

AI-Powered Corrosion Detection for Industrial Equipment: A Scalable Approach with AWS

Create your fashion assistant application using Amazon Titan models and Amazon Bedrock Agents | Amazon Web Services

Conducting Vulnerability Assessments with AI

Modeling relationships to solve complex problems efficiently

People are using Google study software to make AI podcasts—and they’re weird and amazing

Symflower Launches DevQualityEval: A New Benchmark for Enhancing Code Quality in Large Language Models

Ledger starts shipping its high-end hardware crypto wallet | TechCrunch

Innovating safely: Navigating the intersection of AI, network, and security

Leave a Reply Cancel reply

Mechrevo launches affordable Yao M510 gaming mouse with up to 4800 DPI & triple connectivity – Gizmochina

DJI RC Pro Review (Everything You Need to Know)

Windows 11 24H2 is out! @ AskWoody

Watch the mind-bending new trailer for sci-fi epic ‘3 Body Problem’ (video)

The Explorer 2025 is the first Ford to run its new Android infotainment system

iPhone 16 and iPhone 16 Plus to Get More RAM, Faster Wi-Fi: Report

Google Pixel 9 range tipped for major display brightness upgrade

AALTO achieves milestone HAPS regulation, with Design Organisation Approval from UK Civil Aviation Authority

OpenAI Launches Custom GPT Store: How to Access and Use It Right Now

Amazon boosts Throne and Liberty server caps as players flood to try the free MMORPG

Can you replace the Meta Quest 3S cloth head strap?

Amkor and TSMC sign an MOU to collaborate on advanced chip packaging for AI, HPC, PC, and mobile processors at Amkor's planned ~$2B facility in Peoria, Arizona (Anton Shilov/Tom's Hardware)

If You’ve Already Bought AirPods Pro 2, This Insane Prime Day Price Will Make You Jealous

Google is making it easier to protect your data if your phone gets stolen

Survival hit The Planet Crafter terraforms a whole new world in its first DLC

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password