Animal Classification Dataset

Image classification

Animal Classification Dataset

Use Case: Animal Classification

Format: Image

Count: 300k

Annotation: Yes

X

Description: Internet collected animal images in variable scenarios like indoor, outdoor, nature, gardon and so on.

Arabic & Thai & Vietnamese & Hindi & English & Chinese Language Dataset

Bounding box+Text

Arabic & Thai & Vietnamese & Hindi & English & Chinese Language Dataset

Use Case: OCR

Format: Image

Count: 150k

Annotation: Yes

X

Description: Arabic & Thai & Vietnamese & Hindi & English & Chinese Language Dataset

Arabic Text Dataset

Bounding box+Text

Arabic Text Dataset

Use Case: OCR

Format: Image

Count: 1k

Annotation: Yes

X

Description: The Arabic Text Dataset contains a collection of text samples written in Arabic. It includes various forms of content, such as news articles, social media posts, literature, and dialogue, spanning different topics and writing styles. This dataset is used for tasks such as natural language processing (NLP), text classification, sentiment analysis, and machine translation in Arabic language applications.

Asian Face Occlusion Dataset

Instance Segmentation, Semantic Segmentation

Asian Face Occlusion Dataset

Use Case: Asian Face Occlusion Dataset

Format: Image

Count: 44k

Annotation: Yes

X

Description: The "Asian Face Occlusion Dataset" is tailored for the visual entertainment industry, comprising a vast collection of internet-collected images, each with a resolution exceeding 2736 x 3648 pixels. This dataset focuses on instance and semantic segmentation of Asian faces, specifically targeting individuals aged between 18 and 50 with a male-to-female ratio of 3:7. The unique aspect of this dataset is the inclusion of various face-covering items, providing a diverse range of occlusion scenarios.

Asian Single ID Photo Matting Dataset

Contour segmentation

Asian Single ID Photo Matting Dataset

Use Case: Asian Single ID Photo Matting Dataset

Format: Image

Count: 10k

Annotation: Yes

X

Description: The "Asian Single ID Photo Matting Dataset" is curated for the visual entertainment and social networking service (SNS) sectors, featuring a collection of internet-collected Asian face ID photos, all with a high resolution of 6720 x 4480 pixels. This dataset focuses on contour segmentation, offering pixel-level segmentation specifically tailored to Asian facial features in ID photos, facilitating precise face recognition and editing applications.

Asian student classroom Emotions Dataset

Bounding Box, Classification

Asian student classroom Emotions Dataset

Use Case: Asian student classroom Emotions Dataset

Format: Image

Count: 1k

Annotation: Yes

X

Description: The "Asian Student Classroom Emotions Dataset" is specifically designed for educational applications, featuring internet-collected images of Asian students in classroom settings, all at a uniform resolution of 1280 x 720 pixels. This dataset employs bounding box annotations and classification techniques to identify and categorize students' emotional and performance states in the classroom, aiming to enhance educational methodologies and student engagement strategies.

Asian style Gestures Dataset

Bounding box, Tags

Asian style Gestures Dataset

Use Case: Asian style Gestures Dataset

Format: Image

Count: 21,000

Annotation: Yes

X

Description: The "Asian Style Gestures Dataset" is curated for the visual entertainment industry, featuring a collection of internet-collected images with resolutions ranging from 530 x 360 to 2973 x 3968 pixels. This dataset specializes in annotations of hands displaying Asian style gestures, such as nods, hearts, rock, OK, putting hands together, clasping hands, etc., utilizing bounding boxes and tags for precise identification.

Bank Cheque Dataset (Document AI)

Synthetic Bank Cheque

Bank Cheque Dataset (Document AI)

Use Case: OCR

Format: .jpg

Count: 2023

Annotation: No

X

Description: The Bank Cheque Dataset (Document AI): Synthetic bank cheques consists of artificially generated cheque images designed to replicate the appearance and content of real cheques. It includes various elements such as payee names, amounts, dates, signatures, and cheque numbers. This dataset is used for training and evaluating Document AI systems in tasks like optical character recognition (OCR), cheque processing, and automated data extraction, providing a controlled environment for model development without the privacy concerns of real cheques.

Recording Condition: - Clicked Images - Scanned - Web scrapper

Bank Statement Dataset (Document AI)

Synthetic Bank Statements

Bank Statement Dataset (Document AI)

Use Case: OCR

Format: .jpg, png

Count: 5366

Annotation: No

X

Description: The Bank Statement Dataset (Document AI): Synthetic bank statements includes artificially generated bank statements designed to simulate real financial documents. It features various transaction records, dates, amounts, and account details, structured to mirror real-world formats and content. This dataset is used for training and evaluating Document AI systems in tasks such as optical character recognition (OCR), data extraction, and document analysis, offering a controlled environment without the privacy issues of actual financial data.

Recording Condition: - Scanned - Bank_Statement - Web scrapper

Barcode Image Dataset

Barcode Image Dataset

Use Case: Barcode Scan Identification

Format: .mov, mp4

Count: 2767

Annotation: No

X

Description: Barcode Tye: Code128, UPC/EAN, DataMatrix, PDF417, Aztec, Multi-code

Recording Device: Honor 9A, Huawei mate 10 pro, iPad, iPhone (6S, 7 Plus, SE, X, 11, 12, 12 mini, 12 Pro Max), Moto (E4, onepower), One plus (6T, 7T, One), Oppo A3s, Real Me, Samsung (A20, A30, A32, M12, M31), Vivo z1pro, Xiaomi Mi10T+

Recording Condition: - Bright_Indoor - Low_Indoor - Low_Outdoor - Normal - Sunny

Blur Area Segmentation Dataset

Semantic Segmentation

Blur Area Segmentation Dataset

Use Case: Blur Area Segmentation Dataset

Format: Image

Count: 20k

Annotation: Yes

X

Description: The "Blur Area Segmentation Dataset" is designed for use in robotics and visual entertainment, composed of internet-collected images with resolutions ranging from 960 x 720 to 1024 x 768 pixels. This dataset focuses on semantic segmentation, specifically targeting blue areas within images. Each blue area is annotated at the pixel level, providing valuable data for applications requiring color-based segmentation or analysis.

Car Key Point Identification Dataset

Bounding Box,Key Points

Car Key Point Identification Dataset

Use Case: Car Key Point Identification Dataset

Format: Image

Count: 25k

Annotation: Yes

X

Description: The "Car Key Point Identification Dataset" is designed for applications in visual entertainment and autonomous driving, featuring a collection of internet-collected images with a resolution of 640 x 512 pixels. This dataset employs bounding boxes to identify target cars and annotates 14 key points on each vehicle, including the four top points, the four lights, the four wheels, and the glass areas on the front and left side, providing detailed data for car modeling and recognition tasks.

Cat & Dog Segmentation Dataset

Contour segmentation

Cat & Dog Segmentation Dataset

Use Case: Cat & Dog Segmentation Dataset

Format: Image

Count: 70k

Annotation: Yes

X

Description: The "Cat & Dog Segmentation Dataset" is crafted for the media & entertainment and tourism industries, featuring a broad collection of internet-collected images with resolutions varying from 367 x 288 to 3456 x 4608 pixels. This dataset focuses on contour segmentation and includes diverse annotations such as humans, cats, dogs, and environmental elements like walls, tables, grass, and water surfaces, among others.

Cat&Dog Body Segmentation Supplementary Dataset

Contour segmentation

Cat&Dog Body Segmentation Supplementary Dataset

Use Case: Cat&Dog Body Segmentation Supplementary Dataset

Format: Image

Count: 7k

Annotation: Yes

X

Description: The "Cat & Dog Body Segmentation Supplementary Dataset" is tailored for the visual entertainment industry, comprising a variety of internet-collected images with resolutions exceeding 440 x 440 pixels. This dataset focuses on contour segmentation, specifically delineating the outlines of cats and dogs of various breeds, providing detailed data for applications requiring precise pet representations.

CCTV Traffic Scene Semantic Segmentation Dataset

Instance Segmentation

CCTV Traffic Scene Semantic Segmentation Dataset

Use Case: Auto Driving

Format: Video

Count: 1.2k

Annotation: Yes

X

Description: The "CCTV Traffic Scene Semantic Segmentation Dataset" offers a unique perspective for autonomous driving development, capturing the intricacies of traffic scenes from a stationary point of view. Utilizing high-resolution CCTV footage from road monitoring cameras, with resolutions exceeding 1600 x 1200 pixels and a frame rate of over 7 fps, this dataset provides detailed instance segmentation of various elements in traffic, including humans, animals, cycling vehicles, automobiles, and road barriers. It also encompasses a range of weather conditions, offering a robust dataset for training AI systems to understand and interpret diverse traffic scenarios from a fixed vantage point.

Characters Contour Segmentation Dataset

Contour segmentation

Characters Contour Segmentation Dataset

Use Case: Characters Contour Segmentation Dataset

Format: Image

Count: 1,400

Annotation: Yes

X

Description: The "Characters Contour Segmentation Dataset" is specifically designed for Optical Character Recognition (OCR) applications, featuring a collection of internet-collected images with resolutions ranging from 461 x 169 to 1080 x 1350 pixels. This dataset is centered around contour segmentation, focusing on the precise delineation of OCR optical characters to facilitate accurate character recognition and text extraction processes.

Characters Relationship Segmentation Dataset

Semantic Segmentation,Relationship Segmentation

Characters Relationship Segmentation Dataset

Use Case: Characters Relationship Segmentation Dataset

Format: Image

Count: 162.1k

Annotation: Yes

X

Description: The "Characters Relationship Segmentation Dataset" is designed for the robotics and visual entertainment industries, featuring a wide range of internet-collected images with resolutions spanning from 1280 × 720 to 4608 × 3456. This unique dataset focuses on the relationships between humans, and between humans and objects, providing valuable insights for interaction dynamics.

Chinese & English & Tibetan & Uyghur Language Dataset

Bounding box+Text

Chinese & English & Tibetan & Uyghur Language Dataset

Use Case: OCR

Format: Image

Count: 38k

Annotation: Yes

X

Description: Chinese & English & Tibetan & Uyghur Language Dataset

Chinese and English Menu Dataset

Bounding box+Text

Chinese and English Menu Dataset

Use Case: OCR

Format: Image

Count: 60k

Annotation: Yes

X

Description: The Chinese and English Menu Dataset contains images or text samples of restaurant menus that feature both Chinese and English languages. It includes various fonts, layouts, and menu structures, presenting bilingual dish names, descriptions, and prices. This dataset is useful for tasks such as optical character recognition (OCR), machine translation, and menu digitization in multilingual settings.

Chinese Bills Dataset

Bounding box+Text

Chinese Bills Dataset

Use Case: OCR

Format: Image

Count: 6k

Annotation: Yes

X

Description: The Chinese Bills Dataset includes images or text samples of various types of bills, such as invoices, receipts, and statements, written in Chinese. It features diverse formats and content, including item descriptions, amounts, and dates. This dataset is used for tasks like optical character recognition (OCR), financial document processing, and automated data extraction.

Chinese Handwritten Composition Dataset

Bounding box+Text

Chinese Handwritten Composition Dataset

Use Case: OCR

Format: Image

Count: 3k

Annotation: Yes

X

Description: The Chinese Handwritten Composition Dataset contains samples of handwritten Chinese text, including compositions, essays, and other long-form text. It features various handwriting styles and levels of complexity, and is used for tasks such as handwriting recognition, text analysis, and machine learning model training.

Chinese WIFI Prompt Dataset

Bounding box+Text

Chinese WIFI Prompt Dataset

Use Case: OCR

Format: Image

Count: 1k

Annotation: Yes

X

Description: The Chinese WIFI Prompt Dataset consists of text samples found in WIFI prompts and login screens written in Chinese. It typically includes various prompts, instructions, and error messages related to connecting to or managing WIFI networks. This dataset is used for tasks like text recognition, natural language processing, and improving user interfaces for network connectivity.

City Sky Contour Segmentation Dataset

Contour segmentation

City Sky Contour Segmentation Dataset

Use Case: City Sky Contour Segmentation Dataset

Format: Image

Count: 17k

Annotation: Yes

X

Description: The "City Sky Contour Segmentation Dataset" is curated for the visual entertainment sector, featuring a collection of internet-collected images with a high resolution of 3000 x 4000 pixels. This dataset is dedicated to contour segmentation, focusing on capturing the sky in urban settings with elements such as buildings and plants, providing a detailed backdrop for various visual content creation.

Clothes Segmentation Dataset

Contour segmentation, Semantic Segmentation

Clothes Segmentation Dataset

Use Case: Clothes Segmentation Dataset

Format: Image

Count: 14.3k

Annotation: Yes

X

Description: The "Clothes Segmentation Dataset" is crafted for the e-commerce, fashion, and visual entertainment sectors, incorporating a wide array of internet-collected images with resolutions ranging from 183 x 275 to 3024 x 4032 pixels. This dataset specializes in contour and semantic segmentation, featuring around 30 target categories including clothing items, accessories, and body parts, facilitating detailed analysis and application in fashion technology.

Clothing Classification Dataset

Bounding box, Classification

Clothing Classification Dataset

Use Case: Fashion

Format: Image

Count: 2M

Annotation: Yes

X

Description: The "Clothing Classification Dataset" is an essential resource for the fashion, e-commerce, and digital marketing industries, aiming to streamline the online shopping experience. This dataset encompasses a wide array of clothing items collected from the internet, covering various scenarios such as e-commerce websites, fashion shows, social media platforms, and offline user-generated content. It's designed to support the development of sophisticated algorithms for clothing classification, trend analysis, and personalized recommendation systems.

Clothing Keypoints Dataset

Bounding box, Keypoints

Clothing Keypoints Dataset

Use Case: Fashion

Format: Image

Count: 1M

Annotation: Yes

X

Description: The "Clothing Keypoints Dataset" aims to enhance the precision of fashion-related AI applications by providing a large-scale collection of images for keypoint detection tasks. This dataset includes internet-collected images that span a wide array of scenarios, including e-commerce platforms, fashion shows, social media, and offline user-generated content. It is meticulously annotated to identify keypoints on clothing items, facilitating the development of algorithms for pose estimation, size fitting, style matching, and interactive shopping experiences. The dataset includes classified labels, bounding boxes, and keypoints for 80 different clothing types, making it a comprehensive resource for improving the accuracy and reliability of fashion AI systems.

Clothing Pattern Classification Dataset

Classification, Bounding box

Clothing Pattern Classification Dataset

Use Case: Fashion

Format: Image

Count: 200k

Annotation: Yes

X

Description: The "Clothing Pattern Classification Dataset" is specifically designed to address the needs of the fashion industry, focusing on the classification of various clothing patterns. This dataset gathers internet-collected images that showcase clothing from different scenarios such as e-commerce platforms, fashion shows, social media, and offline user-generated content. It aims to facilitate the development of AI models that can accurately recognize and classify over 30 common clothing patterns, enhancing online shopping experiences and supporting trend analysis.

Clothing Segmentation and Fabrics Classification Dataset

Segmentation, Classification

Clothing Segmentation and Fabrics Classification Dataset

Use Case: Fashion

Format: Image

Count: 200k

Annotation: Yes

X

Description: The "Clothing Segmentation and Fabrics Classification Dataset" merges the complexity of clothing segmentation with the specificity of fabric classification, offering a dual-purpose dataset for the fashion industry. It includes internet-collected images from a variety of sources such as e-commerce websites, fashion shows, social media, and offline user-generated content. The dataset is structured to support the development of AI models that can perform both detailed segmentation of clothing items and classify them into 11 common fabric categories, encompassing 80 distinct clothing types. This dual approach aims to enhance online shopping experiences by providing detailed insights into the type of clothing and fabric, facilitating better inventory management and personalized shopping recommendations.

Clothing Segmentation Dataset

Semantic Segmentation

Clothing Segmentation Dataset

Use Case: Fashion

Format: Image

Count: 500k

Annotation: Yes

X

Description: The "Clothing Segmentation Dataset" is designed to propel the capabilities of AI in the fashion industry by providing a comprehensive collection of images for semantic segmentation tasks. This dataset encompasses internet-collected images from various scenarios such as e-commerce platforms, fashion shows, social media, and offline user-generated content. It focuses on enabling precise segmentation of clothing items, including main human parts, clothing pieces, and accessories, to support the development of advanced AI models for automated image analysis and product categorization.

Cloudy Day City Road Dash Cam Video Dataset

Bounding box, Tags

Cloudy Day City Road Dash Cam Video Dataset

Use Case: Auto Driving

Format: Video

Count: 1k

Annotation: Yes

X

Description: The "Cloudy Day City Road Dash Cam Video Dataset" is crafted to address the challenges autonomous driving systems face in overcast weather conditions. Captured with driving recorders at a resolution exceeding 1920 x 1080 pixels and a frame rate of over 31 fps, this dataset ensures detailed visibility even under the diffused lighting of cloudy skies. It includes bounding boxes and tags for more than 10 object categories commonly encountered in urban settings, such as humans, cars, electric bicycles, vans, and trucks. This dataset aims to refine AI models' ability to navigate and make informed decisions in less-than-ideal weather conditions, enhancing safety and reliability.

Cloudy Day Crossroad Dash Cam Video Dataset

Bounding box, Tags

Cloudy Day Crossroad Dash Cam Video Dataset

Use Case: Auto Driving

Format: Video

Count: 2.4k

Annotation: Yes

X

Description: The "Cloudy Day Crossroad Dash Cam Video Dataset" specifically captures the intricate dynamics of crossroad navigation under cloudy weather conditions. This dataset is filmed with high-resolution driving recorders, boasting resolutions over 1920 x 1080 pixels and a frame rate of more than 32 fps, to ensure clarity and detail even in subdued lighting. It annotates more than 10 typical urban object categories, including humans, cars, electric bicycles, vans, and trucks, amidst the unique challenges presented at crossroads during cloudy days. The dataset is an essential resource for developing autonomous driving systems capable of understanding and reacting appropriately to complex urban intersections, especially when visibility is affected by overcast skies.

Common Objects Segmentation Dataset

Instance Segmentation, Semantic Segmentation

Common Objects Segmentation Dataset

Use Case: Common Objects Segmentation Dataset

Format: Image

Count: 140.7k

Annotation: Yes

X

Description: The "Common Objects Segmentation Dataset" serves the e-commerce and visual entertainment industries with a broad collection of internet-collected images, featuring resolutions ranging from 800 × 600 to 4160 × 3120. This dataset covers a wide array of everyday scenes and objects, including people, animals, furniture, and more, annotated for both instance and semantic segmentation.

Damaged Board Parts Segmentation Dataset

Semantic Segmentation

Damaged Board Parts Segmentation Dataset

Use Case: Damaged Board Parts Segmentation Dataset

Format: Image

Count: 1,000

Annotation: Yes

X

Description: The "Damaged Board Parts Segmentation Dataset" is a niche collection tailored for the manufacturing sector, especially in wood and board production. It features internet-collected images with high resolutions ranging from 3024 x 4032 to 2048 x 5750 pixels. This dataset focuses on semantic segmentation of various types of board damage, including cracks, insect damage, and decay, aiding in quality control and manufacturing processes.

Damaged Car (Minor) Video Dataset

Damaged Car (Minor) Video Dataset

Use Case: Insurance Claim Process

Format: avi, mkv, mov, mp4, mp5

Count: 48366

Annotation: No

X

Description: 360 degrees walk around videos of cars with damages at a normal, steady pace with top and bottom always visible Damage: a scratch, dent, ding, or crack that is larger than a golf ball in length Outer Panel Damage: bumpers, fenders, quarter panels, doors, hoods, and trunks Location: Asia, US, Canada, and Europe

Recording Device: Mobile Camera

Recording Condition: Mixed Lighting Conditions

Damaged Car Image Dataset

Damaged Car Image Dataset

Use Case: Insurance Claim Process

Format: .jpg

Count: 3958

Annotation: Yes

X

Description: 490+ cars and 3958 car photos with annotated images (along with metadata) of damaged cars. Covers all sides of the car (8 photos for each car) - Insurance Claim Process Use Cases.

Recording Device: Mobile Camera

Recording Condition: Mixed Lighting Conditions

Dashcam Traffic Scenes Semantic Segmentation Dataset

Semantic Segmentation

Dashcam Traffic Scenes Semantic Segmentation Dataset

Use Case: Auto Driving

Format: Image

Count: 210

Annotation: Yes

X

Description: The "Dashcam Traffic Scenes Semantic Segmentation Dataset" is essential for pushing the boundaries of autonomous driving technologies. This dataset contains driving recorder images with a resolution of about 1280 x 720 pixels, segmented semantically to reflect various elements of urban and suburban traffic environments. It comprehensively categorizes 24 different objects and scenarios, including sky, people, motor vehicles, non-motorized vehicles, highways, pedestrian paths, zebra crossings, trees, buildings, and more. This detailed semantic segmentation allows autonomous driving systems to better understand and interpret the complexities of the road, enhancing navigation and safety protocols.

Drivable Area Segmentation Dataset

Semantic Segmentation, Binary Segmentation

Drivable Area Segmentation Dataset

Use Case: Auto Driving

Format: Image

Count: 115.3k

Annotation: Yes

X

Description: The "Drivable Area Segmentation Dataset" is meticulously crafted to enhance the capabilities of AI in navigating autonomous vehicles through diverse driving environments. It features a wide array of high-resolution images, with resolutions ranging from 1600 x 1200 to 2592 x 1944 pixels, capturing various pavement types such as bitumen, concrete, gravel, earth, snow, and ice. This dataset is vital for training AI models to differentiate between drivable and non-drivable areas, a fundamental aspect of autonomous driving. By providing detailed semantic and binary segmentation, it aims to improve the safety and efficiency of autonomous vehicles, ensuring they can adapt to different road conditions and environments encountered in real-world scenarios.

E-commerce Product Dataset

Classification,Bounding box

E-commerce Product Dataset

Use Case: E-commerce Product Dataset

Format: Image

Count: 2M

Annotation: Yes

X

Description: The "E-commerce Product Dataset" is a comprehensive collection tailored for the e-commerce sector, featuring a wide range of products from 16 main categories including shoes, hats, bags, furniture, digital products, jewelry, and more. With over 200k SKUs, this dataset is equipped with bounding boxes and category tags, making it a pivotal resource for product classification and inventory management.

Eastern Asia Single-person Portrait Matting Dataset

Segmentation,Contour Segmentation

Eastern Asia Single-person Portrait Matting Dataset

Use Case: Eastern Asia Single-person Portrait Matting Dataset

Format: Image

Count: 50k

Annotation: Yes

X

Description: Our "Eastern Asia Single-person Portrait Matting Dataset" targets the nuanced requirements of the fashion, internet, and entertainment sectors, featuring single-person portraits from Eastern Asia in a variety of settings including indoor, outdoor, street, and sport. This dataset is specially curated for pixel-level fine segmentation tasks, capturing diverse postures and scenarios.

English Menu Dataset

Bounding box+Text

English Menu Dataset

Use Case: OCR

Format: Image

Count: 20k

Annotation: Yes

X

Description: The English Menu Dataset includes images or text samples of restaurant menus written in English. It features a variety of fonts, layouts, and formatting styles, with content ranging from dish names to descriptions and prices. This dataset is often used for tasks like optical character recognition (OCR), text extraction, and menu digitization in food-related applications.

English Scenes Text Dataset

Bounding box+Text

English Scenes Text Dataset

Use Case: OCR

Format: Image

Count: 33k

Annotation: Yes

X

Description: The English Scenes Text Dataset consists of images containing natural scenes with embedded English text. The text appears in various forms, such as signs, billboards, and posters, often in diverse fonts, sizes, and orientations. This dataset is commonly used for training and testing models in text detection, recognition, and scene understanding tasks.

English&Chinese Handwriting Dataset

Bounding box+Text

English&Chinese Handwriting Dataset

Use Case: OCR

Format: Image

Count: 12k

Annotation: Yes

X

Description: The English & Chinese Handwriting Dataset contains handwritten samples in both English and Chinese, showcasing various writing styles and character complexities. It is typically used for training and evaluating handwriting recognition models, supporting multilingual text analysis, and other related research. The dataset includes a diverse range of characters, digits, words, and sentences in both languages.

English&Chinese Shopsign Dataset

Bounding box+Text

English&Chinese Shopsign Dataset

Use Case: OCR

Format: Image

Count: 30k

Annotation: Yes

X

Description: The English & Chinese Shopsign Dataset includes images of shop signs that feature both English and Chinese text. It captures various signage elements such as store names, advertisements, promotions, and directions, displayed in diverse fonts, styles, and formats. This dataset is used for tasks like text detection and recognition, multilingual scene understanding, and improving computer vision models for interpreting bilingual signage.

English&Chinese Special Angle Text Dataset

Bounding box+Text

English&Chinese Special Angle Text Dataset

Use Case: OCR

Format: Image

Count: 50k

Annotation: Yes

X

Description: The English & Chinese Special Angle Text Dataset contains images of text displayed at various angles and orientations in both English and Chinese. It includes text from sources like signs, advertisements, and documents that are not presented in standard horizontal formats. This dataset is used for training and evaluating text detection and recognition models, particularly those capable of handling text in non-traditional orientations and perspectives.

Escalator Face Bounding Dataset

Bounding Box

Escalator Face Bounding Dataset

Use Case: Escalator Face Bounding Dataset

Format: Image

Count: 30k

Annotation: Yes

X

Description: The "Escalator Face Bounding Dataset" is specifically designed for use in government and security sectors, featuring a collection of outdoor-collected images with resolutions exceeding 960 x 540 pixels. This dataset employs bounding boxes to annotate the head, face, and entire body of individuals captured in escalator settings. The annotations are meticulously drawn to encompass the entire face, including any masks that might be worn, ensuring comprehensive facial recognition capabilities even in partially obscured conditions.

Face Parsing Dataset

Segmentation

Face Parsing Dataset

Use Case: Face Parsing Dataset

Format: Image

Count: 100k

Annotation: Yes

X

Description: The "Human Body Semantic Segmentation Dataset" serves the fashion, internet, and entertainment sectors with a diverse collection of human body images. This dataset, featuring an even distribution across genders and ages from various countries, is ideal for applications requiring detailed analysis of human postures, hairstyles, and different scenarios. With fine labeling of 19 human body areas, it facilitates advanced semantic segmentation tasks.

Facial 17 Parts Segmentation Dataset

Semantic Segmentation

Facial 17 Parts Segmentation Dataset

Use Case: Facial 17 Parts Segmentation Dataset

Format: Image

Count: 2k

Annotation: Yes

X

Description: The "Facial 17 Parts Segmentation Dataset" is specifically compiled for the visual entertainment industry, featuring a range of internet-collected facial images with resolutions exceeding 1024 x 682 pixels. This dataset is dedicated to semantic segmentation, delineating 17 facial categories such as eyebrows, lips, eye pupils, and more. It also includes a selection of portrait images with occlusions, adding complexity and diversity to the dataset for more realistic application scenarios.

Facial Color Segmentation Dataset

Semantic Segmentation

Facial Color Segmentation Dataset

Use Case: Facial Color Segmentation Dataset

Format: Image

Count: 3.9k

Annotation: Yes

X

Description: The "Facial Color Segmentation Dataset" is tailored for the beauty and visual entertainment sectors, consisting of internet-collected images with resolutions from 1028 x 1028 to 6016 x 4016 pixels. This dataset focuses on semantic segmentation of facial skin colors, including black, yellow, white, and brown, facilitating diverse applications in cosmetics, virtual makeovers, and inclusive digital content.

Facial Parts Semantic Segmentation Dataset

Semantic Segmentation,Bounding box

Facial Parts Semantic Segmentation Dataset

Use Case: Facial Parts Semantic Segmentation Dataset

Format: Image

Count: 2,791.7k

Annotation: Yes

X

Description: The "Facial Parts Semantic Segmentation Dataset" supports the beauty and media & entertainment sectors, with a collection of images sourced both online and offline. Resolutions vary from 300 x 300 to 4480 x 6720, covering comprehensive facial area categories such as eyes, eyebrows, nose, mouth, hair, and accessories, each meticulously annotated for semantic segmentation and bounding box tasks.

Facial Recognition Datasets

Facial Recognition Datasets

Use Case: Face Recognition

Format: .jpg

Count: 831

Annotation: No

X

Description: Facial recognition datasets consist solely of images of faces, with no additional annotations. They include diverse examples of facial features, poses, and lighting conditions, and are used to train and evaluate facial recognition systems for tasks like face detection and recognition.

Recording Condition: Lighting Condition: - Bright Light Or Sunlight - Shade Or Overcast - Night Or Dim Light

Flying Wire Segmentation Dataset

Instance Segmentation

Flying Wire Segmentation Dataset

Use Case: Flying Wire Segmentation Dataset

Format: Image

Count: 13k

Annotation: Yes

X

Description: The "Flying Wire Segmentation Dataset" is specifically developed for the visual entertainment industry, comprising internet-collected images with resolutions exceeding 1024 x 638 pixels. This dataset is focused on instance segmentation, with a primary emphasis on annotating ropes or wires that span between buildings, offering valuable data for creating realistic urban environments in digital content.

Food Contour Matting Dataset

Segmentation, Contour Segmentation

Food Contour Matting Dataset

Use Case: Food Contour Matting Dataset

Format: Image

Count: 30k

Annotation: Yes

X

Description: Our "Food Contour Matting Dataset" enriches the culinary and visual content domains, featuring ~200 food types from global cuisines. It's designed for businesses in catering, tourism, and entertainment, offering personalized experiences through detailed segmentation annotations.

Food Segmentation Dataset

Contour segmentation

Food Segmentation Dataset

Use Case: Food Segmentation Dataset

Format: Image

Count: 8.3k

Annotation: Yes

X

Description: The "Food Segmentation Dataset" serves the tourism and visual entertainment sectors, consisting of a curated selection of internet-collected images with resolutions from 256 x 256 to 1024 x 768 pixels. This dataset is dedicated to contour segmentation, focusing on common foods and their accompanying plates or bowls, facilitating detailed analysis and representation in various applications.

Full Body Clothing Classification Dataset

Classification, Bounding box

Full Body Clothing Classification Dataset

Use Case: Fashion

Format: Image

Count: 31k

Annotation: Yes

X

Description: The "Full Body Clothing Classification Dataset" is specifically curated to support the advancement of AI in recognizing and classifying full-body clothing from a wide range of internet-collected images. With a focus on high-resolution images, specifically 768 x 1024 pixels, this dataset aims to enhance the precision in classifying full-body attire into major categories such as tops, pants, and skirts, further delineating into 30 sub-categories including jackets, sportswear, baseball uniforms, sweaters, sweatpants, jeans, and half skirts, among others. This dataset is designed to facilitate the development of sophisticated AI models that can accurately classify complex clothing types in full-body images, thereby improving the efficiency and user experience of online fashion retail.

Ghost Image Dataset

Ghost Image Dataset

Use Case: Ghost Image Recognition

Format: HEIC (images) & .mov (videos)

Count: 15610

Annotation: No

X

Description: Sets of still images taken in either daytime or nighttime settings where natural or artificial lighting create a digital artifact known as a ghost.

Recording Device: iPhone & iPad Camera

Recording Condition: - Day Time - Night Time

Glasses Segmentation Dataset

Semantic Segmentation

Glasses Segmentation Dataset

Use Case: Glasses Segmentation Dataset

Format: Image

Count: 13.9k

Annotation: Yes

X

Description: The "Glasses Segmentation Dataset" is aimed at the apparel and visual entertainment sectors, incorporating a diverse array of internet-collected images with resolutions from 165 x 126 to 1250 x 1458 pixels. This dataset focuses on semantic segmentation of various types of eyewear, including pure transparent glasses, sunglasses, and translucent glasses, providing detailed annotations for each category.

Hair Semantic Segmentation Dataset

Contour Segmentation, Semantic Segmentation

Hair Semantic Segmentation Dataset

Use Case: Hair Semantic Segmentation Dataset

Format: Image

Count: 32.2k

Annotation: Yes

X

Description: The "Hair Semantic Segmentation Dataset" serves the apparel and media & entertainment industries, featuring a curated collection of internet-collected images with resolutions varying from 343 x 358 to 2316 x 3088 pixels. This dataset specializes in high-precision contour and semantic segmentation of hair, offering detailed annotations for a wide range of hairstyles and textures.

Hand Key Point Skeleton Dataset

Key Points

Hand Key Point Skeleton Dataset

Use Case: Hand Key Point Skeleton Dataset

Format: Image

Count: 10k

Annotation: Yes

X

Description: The "Hand Key Point Skeleton Dataset" is designed for applications in visual entertainment and augmented/virtual reality (AR/VR), featuring a collection of indoor-collected images with a high resolution of 3024 x 4032 pixels. This dataset focuses on labeling 21 key points of the hand skeleton, capturing specific single-handed or two-handed poses such as forming a heart shape, placing a hand on the cheek, stretching, and more.

Handwritten Text Dataset

Handwritten Text Dataset

Use Case: Document AI

Format: HEIC (images) & .mov (videos)

Count: 94053

Annotation: No

X

Description: Live Photos with Handwritten text for Japanese, Korean & Russian

Recording Device: iPhone & iPad Camera

Recording Condition: - Aggressive Lighting/Glare - Camera Flash On - Colored Light - Low Light, No Camera Flash - Normal

Head and Neck Semantic Segmentation Dataset

Semantic Segmentation

Head and Neck Semantic Segmentation Dataset

Use Case: Head and Neck Semantic Segmentation Dataset

Format: Image

Count: 14k

Annotation: Yes

X

Description: The "Head and Neck Semantic Segmentation Dataset" is designed for the e-commerce & retail and media & entertainment sectors, featuring a collection of AI-generated cartoon images with resolutions above 1024 x 1024 pixels. This dataset focuses on semantic segmentation, specifically targeting the main character's head, including face, hair, and any accessories, as well as the neck area up to the collarbone, with an allowance for small, unsegmented parts on the edges.

Historical Dataset

Historical Dataset

Use Case: Landmark Identification, Landmarks Tagging

Format: .jpg, mp4

Count: 2087

Annotation: No

X

Description: Collect images (1 Enrollment photo, 20 Historical photos per Identity) and videos (1 Indoor, 1 Outdoor) from unique identities

Human And Accessories Segmentation Dataset

Semantic Segmentation

Human And Accessories Segmentation Dataset

Use Case: Human And Accessories Segmentation Dataset

Format: Image

Count: 74.3k

Annotation: Yes

X

Description: The "Human And Accessories Segmentation Dataset" is a valuable resource for the apparel, e-commerce, and media & entertainment industries, featuring internet-collected images with resolutions ranging from 584 x 429 to 3744 x 5616. This dataset is rich in diversity, encompassing a wide array of accessories like mobile phones, suitcases, skateboards, and animals, all annotated for semantic segmentation.

Human And Multi-object Panoptic Segmentation Dataset

Instance Segmentation, Semantic Segmentation

Human And Multi-object Panoptic Segmentation Dataset

Use Case: Human And Multi-object Panoptic Segmentation

Format: Image

Count: 8k

Annotation: Yes

X

Description: The "Human And Multi-object Panoptic Segmentation Dataset" is curated for applications in visual entertainment, featuring a wide array of internet-collected images with resolutions exceeding 1280 x 700 pixels. This comprehensive dataset integrates both instance and semantic segmentation to label a diverse range of elements found in everyday life, including natural scenery, people, buildings, and animals, offering a panoptic view of various scenes and subjects.

Human Body High Precision Segmentation Dataset

Semantic Segmentation

Human Body High Precision Segmentation Dataset

Use Case: Human Body High Precision Segmentation Dataset

Format: Image

Count: 424.8k

Annotation: Yes

X

Description: The "Human Body High Precision Segmentation Dataset" is a comprehensive collection aimed at the apparel, e-commerce, and visual entertainment sectors, combining manually shot and internet-collected images with resolutions from 316 × 600 to 6601 × 9900. It focuses on high-precision segmentation of the human body, capturing intricate details of limbs, clothing, facial features, skin, and accessories.

Human Body Parts Fine Segmentation Dataset

Instance Segmentation, Semantic Segmentation

Human Body Parts Fine Segmentation Dataset

Use Case: Human Body Parts Fine Segmentation

Format: Video

Count: 1.7k

Annotation: Yes

X

Description: Images are from internet. Resolution ranges from 105 x 251 to 319 x 951.

Human Body Segmentation Dataset

Semantic Segmentation

Human Body Segmentation Dataset

Use Case: Human Body Segmentation Dataset

Format: Image

Count: 85.7k

Annotation: Yes

X

Description: The "Portrait Matting Dataset" caters to the apparel and media & entertainment sectors, featuring a diverse collection of live screenshot images with resolutions varying from 138 × 189 to 6000 × 4000. This dataset is comprehensive, including single individuals, groups, and their accessories, and is annotated for contour, semantic, and instance segmentation tasks.

Human Body Semantic Segmentation Dataset

Segmentation

Human Body Semantic Segmentation Dataset

Use Case: Human Body Semantic Segmentation Dataset

Format: Image

Count: 100k

Annotation: Yes

X

Description: The "Human Body Semantic Segmentation Dataset" serves the fashion, internet, and entertainment sectors with a diverse collection of human body images. This dataset, featuring an even distribution across genders and ages from various countries, is ideal for applications requiring detailed analysis of human postures, hairstyles, and different scenarios. With fine labeling of 19 human body areas, it facilitates advanced semantic segmentation tasks.

Human Contour Segmentation And Keypoints Dataset

Contour segmentation, Key points

Human Contour Segmentation And Keypoints Dataset

Use Case: Human Contour Segmentation And Keypoints Dataset

Format: Image

Count: 14.4k

Annotation: Yes

X

Description: The "Human Contour Segmentation And Keypoints Dataset" is aimed at the apparel and visual entertainment industries, featuring a collection of internet-collected images with resolutions ranging from 103 x 237 to 329 x 669 pixels. This dataset is focused on contour segmentation and key points annotation, covering comprehensive human body keypoints including facial features, limbs, and extremities, facilitating detailed human posture and movement analysis.

Human Portrait Matting Dataset

Instance Segmentation, Semantic Segmentation

Human Portrait Matting Dataset

Use Case: Human Portrait Matting

Format: Video

Count: 4.1k

Annotation: Yes

X

Description: Images are from internet. Resolution ranges from 1280 x 720 to 2048 x 1080.

Human Posture Classification Dataset

Bounding Box, Tags

Human Posture Classification Dataset

Use Case: Human Posture Classification Dataset

Format: Image

Count: 17k

Annotation: Yes

X

Description: The "Human Posture Classification Dataset" is designed for applications in visual entertainment and robotics, consisting of a collection of indoor-collected images with high resolutions exceeding 3024 x 4032 pixels. This dataset emphasizes bounding box annotations and tagging to identify half-body portraits and classify them into 14 distinct types of poses, such as crossed hands, hands around the head, and one hand on the cheek, among others.

Indoor Facial 130 Expressions Dataset

Key points

Indoor Facial 130 Expressions Dataset

Use Case: Indoor Facial 130 Expressions Dataset

Format: Image

Count: 4k

Annotation: Yes

X

Description: The "Indoor Facial 130 Expressions Dataset" is designed for applications in media & entertainment and mobile sectors, featuring a collection of internet-collected indoor facial images with resolutions ranging from 443 x 443 to 1127 x 1080 pixels. This dataset specializes in key points annotation, providing 130 key points for each facial expression, offering a detailed foundation for emotion recognition, facial animation, and interactive applications.

Indoor Facial 182 Keypoints Dataset

Key Points

Indoor Facial 182 Keypoints Dataset

Use Case: Indoor Facial 182 Keypoints Dataset

Format: Image

Count: 28,000

Annotation: Yes

X

Description: The "Indoor Facial 182 Keypoints Dataset" is a specialized resource for the internet, media, entertainment, and mobile industries, focusing on detailed facial analysis. It includes images of 50 individuals in indoor settings, with a balanced gender distribution and ages ranging from 18 to 50. Each face is annotated with 182 key points, facilitating precise facial feature tracking and analysis.

Indoor Facial 75 Expressions Dataset

Key Points

Indoor Facial 75 Expressions Dataset

Use Case: Indoor Facial 75 Expressions Dataset

Format: Image

Count: 20k

Annotation: Yes

X

Description: The "Indoor Facial 75 Expressions Dataset" enriches the internet, media, entertainment, and mobile sectors with an in-depth exploration of human emotions. It features 60 individuals in indoor settings, showcasing a balanced gender representation and varied postures, with 75 distinct facial expressions per person. This dataset is tagged with facial expression categories, making it an invaluable tool for emotion recognition and interactive applications.

Indoor Multi-person Panoptic Segmentation Dataset

Panoptic Segmentation

Indoor Multi-person Panoptic Segmentation Dataset

Use Case: Indoor Multi-person Panoptic Segmentation Dataset

Format: Image

Count: 14k

Annotation: Yes

X

Description: The "Indoor Multi-person Panoptic Segmentation Dataset" is designed for the visual entertainment sector, consisting of a collection of internet-collected indoor images with resolutions exceeding 1543 x 2048 pixels. This dataset emphasizes panoptic segmentation, capturing every identifiable instance within indoor scenes, including people, furniture, tableware, food, and other elements, providing a comprehensive dataset for detailed indoor scene analysis and creation.

Indoor Multiple Person & Object Segmentation Dataset

Segmentation

Indoor Multiple Person & Object Segmentation Dataset

Use Case: Indoor Multiple Person & Object Segmentation Dataset

Format: Image

Count: 7,500

Annotation: Yes

X

Description: The "Indoor Multiple Person & Object Segmentation Dataset" is designed for the internet and media & entertainment sectors, featuring a collection of drama images set in indoor living scenarios. This dataset, with an average of 5 to 6 persons per picture, spans Asian, American, and English contexts. It supports detailed semantic segmentation tasks for human body areas, clothing and accessories, and indoor objects.

Indoor Objects Segmentation Dataset

Instance Segmentation, Semantic Segmentation,Contour Segmentation

Indoor Objects Segmentation Dataset

Use Case: Indoor Objects Segmentation Dataset

Format: Image

Count: 51.6k

Annotation: Yes

X

Description: The "Indoor Objects Segmentation Dataset" serves the advertisement, gaming, and visual entertainment sectors, offering high-resolution images ranging from 1024 × 1024 to 3024 × 4032. This dataset includes over 50 types of common indoor objects and architectural elements, such as furniture and room structures, annotated for instance, semantic, and contour segmentation.

Industrial Metal Smelting Flame Classification

Classification

Industrial Metal Smelting Flame Classification

Use Case: Industrial Metal Smelting Flame Classification

Format: Image

Count: 41k

Annotation: Yes

X

Description: The "Industrial Metal Smelting Flame Classification Dataset" is designed for the industry sector, featuring a collection of internet-collected images of metal smelting flames, all with a resolution of 350 x 350 pixels. This dataset is dedicated to the classification of flame images into 10 categories, including overexposure, black smoke, fire mass, sparks, and various intensities of slag jumping and spatter, providing crucial data for monitoring and optimizing smelting processes.

Japanese & Korean Language Dataset

Bounding box+Text

Japanese & Korean Language Dataset

Use Case: OCR

Format: Image

Count: 40k

Annotation: Yes

X

Description: The Japanese & Korean Language Dataset includes text samples in both Japanese and Korean. It features a range of content such as sentences, phrases, and words, encompassing various contexts and styles. This dataset is used for tasks like natural language processing (NLP), machine translation, and text analysis in multilingual applications.

Kitchen Sanitation Video Dataset

Bounding box, Tags

Kitchen Sanitation Video Dataset

Use Case: Kitchen Sanitation Video Dataset

Format: Video

Count: 7k

Annotation: Yes

X

Description: CCTV cameras Images. Resolution is over 1920 x 1080 and the number of frames per second of the video is over 30.

Landmark Image Dataset

Landmark Image Dataset

Use Case: Landmark Identification, Landmarks Tagging

Format: .jpg

Count: 34118

Annotation: No

X

Description: Images of landmarks within the context of their environment

Recording Device: Mobile Camera

Recording Condition: - Daylight - Night - Overcast/Rain

Lane Line Segmentation Dataset

Binary Segmentation, Semantic Segmentation

Lane Line Segmentation Dataset

Use Case: Auto Driving

Format: Image

Count: 135.3k

Annotation: Yes

X

Description: The "Lane Line Segmentation Dataset" is designed to accelerate advancements in autonomous driving technologies, specifically focusing on lane detection and segmentation. It includes a vast array of images from driving recorders, segmented into 35 distinct categories to cover a comprehensive range of road markings such as various solid and dashed lines in white and yellow. This dataset aims to refine the precision of AI in identifying lane boundaries, crucial for the safe navigation of autonomous vehicles.

Lane Merging and Fork Area Segmentation Dataset

Binary Segmentation

Lane Merging and Fork Area Segmentation Dataset

Use Case: Auto Driving

Format: Image

Count: 4.2k

Annotation: Yes

X

Description: The "Lane Merging and Fork Area Segmentation Dataset" specifically addresses the complexities of lane merging and forking, critical scenarios in autonomous driving. This dataset, consisting of driving recorder images, is annotated for binary segmentation, focusing on areas where lanes merge or branch off. It includes detailed labels for lane merging areas, lane fork areas (marked by triangular inverted lines), and potential obstructions such as vehicles, trees, road signs, and pedestrians. This dataset is a vital tool for training AI models to navigate these challenging road situations, ensuring smoother and safer autonomous driving experiences.

Lips Segmentation Dataset

Semantic Segmentation

Lips Segmentation Dataset

Use Case: Lips Segmentation Dataset

Format: Image

Count: 13.9k

Annotation: Yes

X

Description: The "Glasses Segmentation Dataset" is aimed at the apparel and visual entertainment sectors, incorporating a diverse array of internet-collected images with resolutions from 165 x 126 to 1250 x 1458 pixels. This dataset focuses on semantic segmentation of various types of eyewear, including pure transparent glasses, sunglasses, and translucent glasses, providing detailed annotations for each category.

Long-range Pedestrian Dataset

Bounding Box

Long-range Pedestrian Dataset

Use Case: Long-range Pedestrian Dataset

Format: Image

Count: 10k

Annotation: Yes

X

Description: The "Long-range Pedestrian Dataset" is curated for the visual entertainment sector, featuring a collection of outdoor-collected images with a high resolution of 3840 x 2160 pixels. This dataset is focused on long-distance pedestrian imagery, with each target pedestrian precisely labeled with a bounding box that closely fits the boundary of the pedestrian target, providing detailed data for scene composition and character placement in visual content.

Low lighting Dash Cam Video Dataset

Bounding box, Tags

Low lighting Dash Cam Video Dataset

Use Case: Auto Driving

Format: Video

Count: 800

Annotation: Yes

X

Description: The "Low Lighting Dash Cam Video Dataset" is tailored for autonomous driving systems to navigate through low-light conditions, a crucial capability for safe driving during night-time or in poorly lit environments. Captured with driving recorders at resolutions exceeding 1920 x 1080 pixels and a frame rate of more than 30 fps, this dataset focuses on low lighting scenarios across various settings such as crossroads, avenues, and paths. It encompasses bounding boxes and tags for common urban objects like humans, cars, electric bicycles, vans, and trucks, providing a comprehensive view of the challenges faced by autonomous vehicles in reduced visibility.

Machine Part Defects Segmentation Dataset

Binary Segmentation

Machine Part Defects Segmentation Dataset

Use Case: Machine Part Defects Segmentation Dataset

Format: Image

Count: 120k

Annotation: Yes

X

Description: The "Machine Part Defects Segmentation Dataset" is designed for the manufacturing industry, consisting of internet-collected images, all with a resolution of 1000 x 1000 pixels. This dataset focuses on binary segmentation to identify white defects on machine parts, providing clear annotations that highlight areas of concern for quality control and inspection processes.

Machine Parts Segmentation Dataset

Semantic Segmentation, Polygon, Key Points

Machine Parts Segmentation Dataset

Use Case: Machine Parts Segmentation Dataset

Format: Image

Count: 2.3k

Annotation: Yes

X

Description: The "Machine Parts Segmentation Dataset" is tailored for the manufacturing sector, featuring a collection of internet-collected images with a resolution of 2048 x 1536 pixels. This dataset is specialized in semantic segmentation, polygon, and key points annotations, focusing on contour annotation of machining positions within X-ray images of machine parts, facilitating precise analysis and inspection in manufacturing processes.

Main Objects Segmentation Dataset

Contour segmentation, Semantic Segmentation

Main Objects Segmentation Dataset

Use Case: Main Objects Segmentation Dataset

Format: Image

Count: 177.4k

Annotation: Yes

X

Description: The "Main Objects Segmentation Dataset" is designed for applications in robotics and visual entertainment, comprising a vast collection of internet-collected images with resolutions ranging from 189 x 223 to 5472 x 3648 pixels. This dataset focuses on contour and semantic segmentation of a single labeled subject in each image, providing a clear and isolated view of the primary object for detailed analysis and application.

Model Clothing Segmentation Dataset

Semantic Segmentation

Model Clothing Segmentation Dataset

Use Case: Model Clothing Segmentation Dataset

Format: Image

Count: 2k

Annotation: Yes

X

Description: The "Model Clothing Segmentation Dataset" is curated for the e-commerce & retail sector, featuring a collection of internet-collected images with a resolution of 816 x 1224 pixels. This dataset focuses on semantic segmentation of high-resolution images showcasing models in various outfits, encompassing male, female, and children's wear, to accurately reflect real human silhouettes. The annotations include detailed segmentation of the clothing worn by the models, such as hats, shoes, tops, and bottoms.

Multi-person And Appendages Segmentation Dataset

Instance Segmentation, Semantic Segmentation

Multi-person And Appendages Segmentation Dataset

Use Case: Multi-person And Appendages Segmentation Dataset

Format: Image

Count: 7k

Annotation: Yes

X

Description: The "Multi-person And Appendages Segmentation Dataset" is designed for the visual entertainment sector, featuring a collection of internet-collected images with resolutions exceeding 2736 x 3648 pixels. This dataset employs both instance and semantic segmentation techniques to annotate multiple people and their appendages in various scenes. The appendages include shadows, hand-held objects, riding objects, and more, providing a comprehensive view of human interactions with their environment.

Multi-Pet Matting Dataset

Contour Segmentation,Semantic Segmentation

Multi-Pet Matting Dataset

Use Case: Multi-Pet Matting Dataset

Format: Image

Count: 7k

Annotation: Yes

X

Description: The "Multi-Pet Matting Dataset" is curated for applications in visual entertainment and financial services, featuring a collection of internet-collected images with resolutions exceeding 1920 x 1280 pixels. This dataset focuses on both contour and semantic segmentation of multiple pet instances within each image, specifically limited to cats and dogs. Each pet instance is saved with an individual matting mask, with the mask granularity refined to the hair-strand level, providing detailed data for creating realistic pet representations and interactions in digital content.

Multi-Scenario, Multi-Person Instance Segmentation Dataset

Instance Segmentation,Bounding Box

Multi-Scenario, Multi-Person Instance Segmentation Dataset

Use Case: Multi-Scenario, Multi-Person Instance Segmentation Dataset

Format: Image

Count: 10k

Annotation: Yes

X

Description: The "Multi-Scenario, Multi-Person Instance Segmentation Dataset" is designed for diverse applications in visual entertainment, media & entertainment, and e-commerce & retail sectors. It consists of a collection of internet-collected images with resolutions exceeding 640 x 480 pixels. This dataset is characterized by its variety, featuring different scenarios such as individuals sitting, groups seated together, people holding props, interacting with various attire like hats and bags, and making different gestures. It employs instance segmentation and bounding box annotations to facilitate comprehensive analysis of human subjects within these varied contexts.

Multiple Objects Matting Dataset

Segmentation

Multiple Objects Matting Dataset

Use Case: Multiple Objects Matting Dataset

Format: Image

Count: 318.6k

Annotation: Yes

X

Description: The "Multiple Objects Matting Dataset" is designed for use in robotics and visual entertainment, featuring a vast collection of internet-collected images with resolutions ranging from 1080 x 1362 to 6000 x 4000 pixels. This dataset specializes in segmentation, providing the original image, a transparent effect image, and a mask black-and-white image for the main object, enabling detailed analysis and application in various technological solutions.

Multiple Scenarios And Persons Semantic Segmentation Dataset

Contour Segmentation,Semantic Segmentation

Multiple Scenarios And Persons Semantic Segmentation Dataset

Use Case: Multiple Scenarios And Persons Semantic Segmentation

Format: Image

Count: 54k

Annotation: Yes

X

Description: The "Multiple Scenarios And Persons Semantic Segmentation" dataset is tailored for the visual entertainment industry, comprising internet-collected images with resolutions from 1280 x 720 to 6000 x 4000. It focuses on multi-person scenes across urban, natural, and indoor settings, providing detailed annotations for human figures, accessories, and backgrounds.

Nails Contour Segmentation Dataset

Semantic Segmentation

Nails Contour Segmentation Dataset

Use Case: Nails Contour Segmentation Dataset

Format: Image

Count: 5.9k

Annotation: Yes

X

Description: The "Nails Contour Segmentation Dataset" is crafted for the beauty industry, featuring a collection of offline human fingernail images, all at a uniform resolution of 1920 x 1080 pixels. This dataset specializes in semantic segmentation, with a focus on the detailed contour of fingernails, supporting applications in nail art design and virtual nail try-on technologies.

Object Contour Matting Dataset

Segmentation

Object Contour Matting Dataset

Use Case: Object Contour Matting Dataset

Format: Image

Count: 50k

Annotation: Yes

X

Description: The "Object Contour Matting Dataset" is a versatile collection tailored for the e-commerce, internet, and mobile sectors, encompassing a wide range of objects like clothing, accessories, merchandise, plants, and food. This dataset focuses on contour segmentation of the main object, making it a valuable resource for applications that require precise object outline extraction.

Objects and Distractions Segmentation Dataset

Contour segmentation

Objects and Distractions Segmentation Dataset

Use Case: Objects and Distractions Segmentation Dataset

Format: Image

Count: 10.8k

Annotation: Yes

X

Description: The "Objects and Distractions Segmentation Dataset" is designed for robotics and visual entertainment sectors, featuring a range of internet-collected images with resolutions between 1365 x 2047 and 4165 x 2737 pixels. This dataset emphasizes semantic segmentation, categorizing images into five main types of interference objects, including target persons, objects, interference items, and various human body parts, facilitating the development of algorithms to distinguish between primary subjects and background distractions.

Obvious Objects Segmentation Dataset

Semantic Segmentation, Contour segmentation

Obvious Objects Segmentation Dataset

Use Case: Obvious Objects Segmentation Dataset

Format: Image

Count: 2.0k

Annotation: Yes

X

Description: The "Obvious Objects Segmentation Dataset" is a specialized collection aimed at the media and visual entertainment sectors, featuring internet-collected images all at a uniform resolution of 1536 x 2048 pixels. This dataset is dedicated to the segmentation of salient objects that are immediately noticeable and attract attention in an image, utilizing both semantic and contour segmentation techniques to define these objects at the pixel level.

Old Person and Children Contour Segmentation Dataset

Contour segmentation

Old Person and Children Contour Segmentation Dataset

Use Case: Old Person and Children Contour Segmentation Dataset

Format: Image

Count: 20.3k

Annotation: Yes

X

Description: The "Old Person and Children Contour Segmentation Dataset" is crafted for the visual entertainment sector, featuring a collection of internet-collected images with resolutions ranging from 867 x 867 to 6000 x 4000 pixels. This dataset specializes in contour segmentation, focusing on delineating the outlines of elderly individuals and children, facilitating age-specific content creation and character modeling.

Old Person and Children Contour Segmentation Dataset

Semantic Segmentation

Old Person and Children Contour Segmentation Dataset

Use Case: Sky and Person Sematic Segmentation Dataset

Format: Image

Count: 736

Annotation: Yes

X

Description: The "Sky and Person Semantic Segmentation Dataset" is developed for the visual entertainment industry, comprising internet-collected images with resolutions ranging from 1024 x 1297 to 4000 x 2667 pixels. This dataset focuses on semantic segmentation, providing annotations for 14 categories including sky, various human features like hair, face, and clothing, as well as accessories such as bags and sunglasses, offering a comprehensive resource for detailed scene and character analysis.

Outdoor Building Panoptic Segmentation Dataset

Panoptic Segmentation

Outdoor Building Panoptic Segmentation Dataset

Use Case: Outdoor Building Panoptic Segmentation Dataset

Format: Image

Count: 1k

Annotation: Yes

X

Description: The "Outdoor Building Panoptic Segmentation Dataset" is curated for the visual entertainment industry, consisting of a collection of internet-collected outdoor images with high resolutions exceeding 3024 x 4032 pixels. This dataset focuses on panoptic segmentation, capturing every identifiable instance within the outdoor scenes, including buildings, roads, people, cars, and more, providing a comprehensive dataset for detailed environmental analysis and creation.

Outdoor Multi-person Panoptic Segmentation Dataset

Panoptic Segmentation

Outdoor Multi-person Panoptic Segmentation Dataset

Use Case: Outdoor Multi-person Panoptic Segmentation Dataset

Format: Image

Count: 26k

Annotation: Yes

X

Description: The "Outdoor Multi-person Panoptic Segmentation Dataset" is tailored for the visual entertainment industry, featuring a collection of internet-collected outdoor images with resolutions ranging from 1543 x 2048 to 3072 x 2304 pixels. This dataset focuses on panoptic segmentation, encompassing multiple people and distinguishable objects such as those on individuals, buildings, vehicles, and plants. Each identifiable instance within the images is annotated, providing a comprehensive view of outdoor scenes.

Outdoor Objects Semantic Segmentation Dataset

Bounding box, Key points

Outdoor Objects Semantic Segmentation Dataset

Use Case: Outdoor Objects Semantic Segmentation Dataset

Format: Image

Count: 7.1k

Annotation: Yes

X

Description: The "Outdoor Objects Semantic Segmentation Dataset" is developed for applications in media & entertainment and robotics, consisting of a variety of internet-collected images with resolutions ranging from 1024 x 726 to 2358 x 1801 pixels. This dataset employs bounding box and key points annotations to segment various outdoor elements, including human body parts, natural scenery, architectural structures, pavements, transportation means, and more.

Panoptic Scenes Segmentation Dataset

Semantic Segmentation

Panoptic Scenes Segmentation Dataset

Use Case: Panoptic Scenes Segmentation Dataset

Format: Image

Count: 21.3k

Annotation: Yes

X

Description: The "Panoptic Scenes Segmentation Dataset" is a comprehensive resource for the robotics and visual entertainment fields, consisting of a wide range of internet-collected images with resolutions from 660 x 371 to 5472 x 3648 pixels. This dataset is aimed at semantic segmentation, capturing diverse elements such as horizontal and vertical planes, buildings, people, animals, and furniture, offering a holistic view of various scenes.

Pay Slips Dataset (Document AI)

Pay Slips Dataset (Document AI)

Use Case: OCR

Format: .jpg

Count: 2010

Annotation: No

X

Description: The Pay Slips Dataset (Document AI): Synthetic Pay Slips consists of images of artificially generated pay slips without any annotations. It features various pay slip formats and details such as employee names, salaries, and dates, used for training and testing Document AI systems in tasks like OCR and document processing.

Recording Condition: - Scanned - Web scrapper

People and Safety Belt Sematic Segmentation Dataset

Instance Segmentation, Semantic Segmentation

People and Safety Belt Sematic Segmentation Dataset

Use Case: People and Safety Belt Sematic Segmentation Dataset

Format: Image

Count: 1.5k

Annotation: Yes

X

Description: The "People and Safety Belt Semantic Segmentation Dataset" is specifically curated for industrial applications, consisting of CCTV images captured within a factory environment at a resolution of 1920 x 1080 pixels. This dataset focuses on both instance and semantic segmentation, providing annotations for people and the seat belts they are wearing, aimed at enhancing safety compliance monitoring.

Person And Clothes Semantic Segmentation Dataset

Instance Segmentation, Semantic Segmentation

Person And Clothes Semantic Segmentation Dataset

Use Case: Person And Clothes Semantic Segmentation Dataset

Format: Image

Count: 197.1k

Annotation: Yes

X

Description: The "Person And Clothes Semantic Segmentation Dataset" is designed for the e-commerce, fashion, and media & entertainment industries, featuring a diverse range of internet-collected images with resolutions spanning from 92 x 153 to 3024 x 5381 pixels. This dataset offers detailed instance and semantic segmentation of clothing items and body parts, including new categories like hats, gloves, and shoes, supporting various applications in online retail and fashion technology.

Person Home Activity Dataset

Person Home Activity Dataset

Use Case: Motion Detection, Security Surveillance

Format: mp4

Count: 10002

Annotation: No

X

Description: Type 1: videos of people immediately outside of homes at front doors - Person walks toward/past the front door/home - Person walks away from the door/home - One or more person doing an extended activity (standing, looking around, talking) 6-20ft from doorbell. Type 2: videos of people inside the home engaging in certain actions - Sitting and eating, Working at desk, Reading, Sleeping, waking up and getting out of bed, Exercising / Dancing, Falling down, lying hurt on the floor

Recording Condition: Low Light: 20% - Ambient Indoor/Outdoor Lighting - Twilight/Golden Hour Regular Light: 40% - Normal Indoor/Outdoor - Uniformly Lit - Not Overly Saturated/Harsh Bright Light: 40% - Outdoor, Mid-Afternoon, Clear Sky - Indoor Natural Light, Or Brightly Lit - Avoid Over-Saturation Or Blown-Out Scenes

Pig Contour Segmentation Dataset

Semantic Segmentation

Pig Contour Segmentation Dataset

Use Case: Pig Contour Segmentation Dataset

Format: Image

Count: 5.2k

Annotation: Yes

X

Description: The "Pig Contour Segmentation Dataset" is tailored for the animal husbandry industry, comprised of images captured from CCTV viewpoints with a high resolution of 3072 x 2048 pixels. This dataset focuses on semantic segmentation, providing detailed annotations for the contour and center points of pigs, facilitating monitoring and management in pig farming operations.

Portrait Matting Dataset

Contour Segmentation,Semantic Segmentation,Instance Segmentation

Portrait Matting Dataset

Use Case: Portrait Matting Dataset

Format: Image

Count: 29k

Annotation: Yes

X

Description: The "Portrait Matting Dataset" caters to the apparel and media & entertainment sectors, featuring a diverse collection of live screenshot images with resolutions varying from 138 × 189 to 6000 × 4000. This dataset is comprehensive, including single individuals, groups, and their accessories, and is annotated for contour, semantic, and instance segmentation tasks.

Printed Regular/Cursive Text Dataset (Document AI)

Printed Regular/Cursive Text Dataset (Document AI)

Use Case: Document AI

Format: HEIC (images) & .mov (videos)

Count: 23930

Annotation: No

X

Description: Live Photos with Handwritten text for Japanese, Korean & Russian

Recording Device: iPhone & iPad Camera

Recording Condition: - Aggressive Lighting/Glare - Camera Flash On - Colored Light - Low Light, No Camera Flash - Normal

PUBG Game Scenes Segmentation Dataset

Instance Segmentation, Semantic Segmentation

PUBG Game Scenes Segmentation Dataset

Use Case: PUBG Game Scenes Segmentation Dataset

Format: Image

Count: 11.2k

Annotation: Yes

X

Description: The "PUBG Game Scenes Segmentation Dataset" is specifically designed for gaming applications, featuring screenshots from the popular game PUBG with resolutions of 1920 × 886, 1280 × 720, and 1480 × 720 pixels. It encompasses 17 categories for instance and semantic segmentation, including characters, vehicles, landscapes, and in-game items, providing a rich resource for game development and analysis.

Pupils Segmentation Dataset

Semantic Segmentation

Pupils Segmentation Dataset

Use Case: Pupils Segmentation Dataset

Format: Image

Count: 17k

Annotation: Yes

X

Description: The "Pupils Segmentation Dataset" is tailored for applications in the beauty and media & entertainment industries, consisting of internet-collected images with resolutions varying from 90 x 89 to 419 x 419 pixels. This dataset focuses on semantic segmentation, providing subdivision annotations specifically for pupil locations to enhance detailed eye-related features in digital content.

Rail Line Labeling Dataset

Polygon, Bounding Box

Rail Line Labeling Dataset

Use Case: Rail Line Labeling Dataset

Format: Image

Count: 3k

Annotation: Yes

X

Description: The "Rail Line Labeling Dataset" is tailored for industrial applications, featuring a collection of internet-collected images with a resolution of 1920 x 1080 pixels. This dataset specializes in the detailed labeling of rail lines, including their turns and merges, using polygon annotations. Additionally, trains within these images are labeled with bounding boxes. The dataset specifically focuses on rail networks collected from Wuhan, providing a localized context for rail line analysis and train detection.

Rainy Dash Cam Video Dataset

Bounding box, Tags

Rainy Dash Cam Video Dataset

Use Case: Auto Driving

Format: Video

Count: 6.4k

Annotation: Yes

X

Description: The "Rainy Dash Cam Video Dataset" is specifically developed for autonomous driving systems to accurately function under rainy conditions, which pose significant visibility and surface traction challenges. Captured with driving recorders at resolutions exceeding 1920 x 1080 pixels and a frame rate of more than 30 fps, this dataset focuses on rainy day scenarios in urban settings, including crossroads, avenues, and paths. It features bounding boxes and tags for over 10 common urban categories such as humans, cars, electric bicycles, vans, and trucks, under the variable and often difficult lighting conditions that accompany rainy weather.

Remote Sensing Changes Detection Dataset

mask segmentation

Remote Sensing Changes Detection Dataset

Use Case: Remote Sensing Changes Detection Dataset

Format: Image

Count: 230.1k

Annotation: Yes

X

Description: The "Remote Sensing Changes Detection Dataset" is a pivotal resource for the remote sensing field, featuring internet-collected images at a uniform resolution of 1024 x 1024 pixels. This dataset is specifically annotated for mask segmentation, distinguishing between front-phase and back-phase building labels, to facilitate the detection of changes in urban and rural landscapes.

Remote Sensing Object Segmentation Dataset

Instance Segmentation, Semantic Segmentation

Remote Sensing Object Segmentation Dataset

Use Case: Remote Sensing Object Segmentation Dataset

Format: Image

Count: 210.2k

Annotation: Yes

X

Description: The "Remote Sensing Object Segmentation Dataset" is a key asset for the remote sensing field, combining images from the DOTA open dataset and additional internet sources. With resolutions ranging from 451 × 839 to 6573 × 3727 pixels for standard images and up to 25574 × 15342 pixels for uncut large images, this dataset includes diverse categories like playgrounds, vehicles, and sports courts, all annotated for instance and semantic segmentation.

Remote Sensing Scenes Segmentation Dataset

Semantic Segmentation

Remote Sensing Scenes Segmentation Dataset

Use Case: Remote Sensing Scenes Segmentation Dataset

Format: Image

Count: 100

Annotation: Yes

X

Description: The "Remote Sensing Scenes Segmentation Dataset" is a specialized collection for the remote sensing domain, comprised of high-resolution satellite images sourced from the internet, with dimensions ranging from 10752 x 10240 to 12470 x 13650 pixels. This dataset is designed for semantic segmentation, with annotations covering various natural and man-made features such as buildings, forests, water bodies, roads, and farmland.

Road Scene Semantic Segmentation Dataset

Semantic Segmentation

Road Scene Semantic Segmentation Dataset

Use Case: Road Scene Semantic Segmentation Dataset

Format: Image

Count: 2k

Annotation: Yes

X

Description: The "Road Scene Semantic Segmentation Dataset" is specifically designed for autonomous driving applications, featuring a collection of internet-collected images with a standard resolution of 1920 x 1080 pixels. This dataset is focused on semantic segmentation, aiming to accurately segment various elements of road scenes such as the sky, buildings, lane lines, pedestrians, and more, to support the development of advanced driver-assistance systems (ADAS) and autonomous vehicle technologies.

Road Scenes Panoptic Segmentation Dataset

Panoptic Segmentation

Road Scenes Panoptic Segmentation Dataset

Use Case: Road Scenes Panoptic Segmentation Dataset

Format: Image

Count: 1k

Annotation: Yes

X

Description: The "Road Scenes Panoptic Segmentation Dataset" is aimed at applications in visual entertainment and autonomous driving, featuring a collection of internet-collected road scene images with resolutions exceeding 1600 x 1200 pixels. This dataset specializes in panoptic segmentation, annotating every identifiable instance within the images, such as vehicles, roads, lane lines, vegetation, and people, providing a detailed dataset for comprehensive road scene analysis.

Satellite Components Segmentation Dataset

Semantic Segmentation, Polygon

Satellite Components Segmentation Dataset

Use Case: Satellite Components Segmentation Dataset

Format: Image

Count: 22.9k

Annotation: Yes

X

Description: The "Satellite Components Segmentation Dataset" caters to the manufacturing sector, particularly in aerospace and satellite production, featuring internet-collected images with resolutions ranging from 960 x 720 to 1537 x 1018 pixels. This dataset is aimed at semantic segmentation and polygon annotations, covering a wide array of satellite components such as sailboards, antennas, nozzles, and more, to support precision manufacturing and assembly processes.