MovieNet Scene Classification

Classify movie scenes by type, place, and cinematic attributes. Annotators label scenes with location, time of day, weather, and narrative function.

Configuration Fileconfig.yaml

yaml

# MovieNet Scene Classification Configuration
# Based on Qian et al., ECCV 2020
# Task: Classify movie scenes by various attributes

annotation_task_name: "MovieNet Scene Classification"
task_dir: "."

data_files:
  - data.json
item_properties:
  id_key: "id"
  text_key: "video_url"

output_annotation_dir: "annotation_output/"
output_annotation_format: "json"

annotation_schemes:
  - name: "place_type"
    description: "What type of place/location is this scene set in?"
    annotation_type: radio
    labels:
      - "Indoor - Home/Apartment"
      - "Indoor - Office/Workplace"
      - "Indoor - Public (restaurant, store, etc.)"
      - "Indoor - Vehicle"
      - "Outdoor - Urban/City"
      - "Outdoor - Nature/Rural"
      - "Outdoor - Road/Street"
      - "Mixed/Transitional"

  - name: "time_of_day"
    description: "What time of day is depicted?"
    annotation_type: radio
    labels:
      - "Day - Morning"
      - "Day - Afternoon"
      - "Day - Evening/Dusk"
      - "Night"
      - "Unclear/Mixed"

  - name: "weather"
    description: "What is the weather/lighting condition?"
    annotation_type: radio
    labels:
      - "Clear/Sunny"
      - "Cloudy/Overcast"
      - "Rainy"
      - "Snowy"
      - "Foggy/Misty"
      - "Indoor (N/A)"
      - "Unclear"

  - name: "scene_function"
    description: "What narrative function does this scene serve?"
    annotation_type: multiselect
    labels:
      - "Exposition (introduces information)"
      - "Action/Chase"
      - "Dialogue/Conversation"
      - "Emotional/Dramatic"
      - "Comedic"
      - "Romantic"
      - "Suspense/Tension"
      - "Flashback/Dream"
      - "Montage/Transition"

  - name: "shot_scale"
    description: "What is the predominant shot scale?"
    annotation_type: radio
    labels:
      - "Extreme Close-up"
      - "Close-up"
      - "Medium Shot"
      - "Full Shot"
      - "Long Shot"
      - "Extreme Long Shot"
      - "Mixed/Varied"

allow_all_users: true
instances_per_annotator: 50
annotation_per_instance: 2

annotation_instructions: |
  ## Movie Scene Classification Task

  Classify movie scenes by their visual and narrative attributes.

  ### Attributes to Label:

  **Place Type**: Where does the scene take place?

  **Time of Day**: When does the scene occur?

  **Weather**: What's the lighting/weather condition?

  **Scene Function**: What purpose does this scene serve narratively?

  **Shot Scale**: What's the typical camera distance?

  ### Tips:
  - Consider the OVERALL scene, not just one shot
  - Scene function can have multiple labels
  - If unsure, choose the most prominent/frequent option

Sample Datasample-data.json

json

[
  {
    "id": "movienet_001",
    "video_url": "https://example.com/videos/movie_scene_office.mp4",
    "movie": "Sample Movie",
    "scene_number": 1
  },
  {
    "id": "movienet_002",
    "video_url": "https://example.com/videos/movie_scene_outdoor.mp4",
    "movie": "Sample Movie",
    "scene_number": 2
  }
]

Get This Design

View on GitHub

Clone or download from the repository

Quick start:

git clone https://github.com/davidjurgens/potato-showcase.git
cd potato-showcase/video/scene-understanding/movienet-scene-classification
potato start config.yaml

Details

Annotation Types

multiselectradio

Domain

Computer VisionFilm Studies

Use Cases

Scene ClassificationMovie UnderstandingContent Analysis

Related Designs

ADE20K Semantic Segmentation

Comprehensive scene parsing with 150 semantic categories (Zhou et al., CVPR 2017). Annotate indoor and outdoor scenes with pixel-level labels covering objects, parts, and stuff classes.

multiselectradio

BDD100K Autonomous Driving Segmentation

Large-scale diverse driving video dataset (Yu et al., CVPR 2020). Annotate driving scenes with bounding boxes, lane markings, drivable areas, and full-frame instance segmentation.

multiselectradio

CUB-200-2011 Fine-Grained Bird Classification

Fine-grained visual categorization of 200 bird species (Wah et al., 2011). Annotate bird images with species labels, part locations, and attribute annotations.

multiselectradio

MovieNet Scene Classification

Configuration Fileconfig.yaml

Sample Datasample-data.json

Get This Design

Details

Annotation Types

Domain

Use Cases

Tags

Related Designs

ADE20K Semantic Segmentation

BDD100K Autonomous Driving Segmentation

CUB-200-2011 Fine-Grained Bird Classification