Tarot Spread Recognition

Building an AI-powered system to recognize Tarot card spreads and generate textual interpretations.

Objective: Developing a model that detects a user’s 5-card Tarot spread from an image and converts it into structured text

Highlights of this experiment:

  1. Image Processing: Detect and classify each Tarot card in the spread.

  2. OCR: If the image has text (like card names), extract it.

  3. Layout Analysis: Recognize the card positions and their spread meaning.

  4. Convert to Text: Generate a structured textual output describing the spread.

Two methods in implementing this:

  1. Leverage pre-trained OCR tools:

    Problem: The image must be cropped before hand. This means that a box window representing the card positioning is expected so image has to relative to a fixed angle of shot to be usable.

  2. Building CNN for Object Detection: This method considers building object detection model using CNN layers. The objective here is to approximate the box window coordinates (top left corner, top right corner, bottom left corner, bottom right corner) for each positioning card.

  3. Combining the first 2 methods where I rely on the method 1 as the base model in application. Over time, the CNN model is used for adapting to user feedback / corrections.

Last updated