Pattern Recognition System - Quick Start Guide

Overview

This project implements a DTW (Dynamic Time Warping) + KNN pattern recognition system for identifying chart patterns in OHLCV financial data.

Installation

Install Dependencies

source venv/bin/activate
pip install -r requirements.txt

Running the Application

source venv/bin/activate
streamlit run Main_App.py

User Workflow

Step 1: Download Data

Navigate to Data Manager
Download OHLCV data for your desired currency pairs and timeframes
Data is stored in data/parquet/

Step 2: Label Patterns

Navigate to Pattern Manager > Label Patterns
Select a data file from the dropdown
Use the controls to select a pattern:
- Pattern Length: Number of bars to include (default: 50)
- Start Index: Where the pattern begins
- Pattern Label: Name for this pattern (e.g., "head_and_shoulders", "double_top")
Preview the pattern in the candlestick chart
Optionally view the normalized pattern (DDTW representation)
Click Save Pattern to add to library

Tips:

Start with clear, well-formed patterns
Use consistent naming (e.g., "bullish_flag", "bearish_wedge")
Aim for 20-50 examples per pattern type
Patterns are automatically saved to data/patterns/library.pkl

Step 3: Augment & Build Index

Navigate to Pattern Manager > View Library
Click Augment Library (Mirror Patterns)
- Automatically creates inverted versions (bullish → bearish)
- Doubles your training set
Click Build Index (LB_Keogh)
- Builds fast search index for pattern matching
- Required before scanning

Step 4: Train & Validate (NEW)

Navigate to Pattern Manager > Train & Validate
View library statistics and pattern distribution
Click Run Cross-Validation
- Set minimum confidence threshold (0.7 recommended)
- Check accuracy, precision, recall, F1 score
Click Generate Confusion Matrix
- See which patterns are confused with each other
Click Test Multiple Thresholds
- Find optimal confidence threshold
- View precision-recall tradeoff chart
- Use recommended threshold for scanning

Tips:

Aim for >70% accuracy before scanning
If accuracy is low, add more training examples
Use recommended threshold from testing

Step 5: Scan for Patterns (NEW)

Navigate to Pattern Scanner
Select data file to scan
Configure settings:
- Window Size: Match your training patterns (e.g., 50 bars)
- Min Confidence: Use threshold from training (e.g., 0.75)
- Step Size: 5 for balanced speed/thoroughness
(Optional) Filter by specific pattern labels
Click 🔍 Scan for Patterns
Review detected patterns:
- View statistics (total, unique patterns, avg confidence)
- Inspect individual detections with charts
- Analyze price action and timestamps

Step 6: Browse Library

In the View Library tab, you can:

View statistics (total patterns, unique labels, augmented count)
Filter patterns by label
Inspect each pattern (charts, metadata, quality score)
Delete unwanted patterns

Pattern Recognition Features

DTW (Dynamic Time Warping)

Derivative DTW (DDTW): Matches patterns based on shape, not absolute price
Sakoe-Chiba Constraint: Limits warping for robust matching
LB_Keogh Filtering: Fast pre-filtering for real-time performance

KNN Classification

Distance-weighted voting (closer patterns have more influence)
Configurable k (number of neighbors, default: 5)

Confidence Scoring

Multi-metric confidence calculation based on:

Closeness (35%): How close is the nearest match?
Consensus (30%): Do neighbors agree on the label?
Separation (20%): How different is the next-best label?
Quality (15%): Quality score of matching templates

Configuration

Edit config/pattern_config.yaml to customize:

dtw:
  variant: "derivative"        # "derivative" or "standard"
  constraint: "sakoe_chiba"    # "sakoe_chiba", "adtw", or "none"
  sakoe_chiba_window: 0.15     # Window size (15% of sequence)

knn:
  k: 5                         # Number of neighbors

confidence:
  min_threshold: 0.7           # Default confidence threshold

Testing

Run the test suite:

source venv/bin/activate
python -m pytest tests/ -v

Project Structure

chart_pattern_prototype/
├── Main_App.py                 # Streamlit entry point
├── pages/
│   ├── data_manager_parent.py
│   ├── analysis_parent.py
│   └── pattern_manager_parent.py  # NEW: Pattern management
├── components/
│   ├── tab_label_patterns.py      # NEW: Pattern labeling UI
│   └── tab_view_library.py        # NEW: Library viewer UI
├── engine/                        # NEW: Core pattern recognition
│   ├── preprocessor.py
│   ├── dtw_core.py
│   ├── pattern_library.py
│   ├── pattern_matcher.py
│   └── confidence.py
├── models/                        # NEW: Data models
│   ├── pattern.py
│   └── match_result.py
├── config/
│   └── pattern_config.yaml        # NEW: Configuration
├── data/
│   ├── parquet/                   # OHLCV data
│   └── patterns/                  # NEW: Pattern storage
└── tests/                         # NEW: Test suite

Next Steps

After completing Phases 1-4, you can extend the system with:

Training & Backtesting UI (Phase 5)
- Cross-validation metrics
- Hyperparameter tuning
- Precision/recall curves
Real-time Pattern Scanner (Phase 6)
- Live pattern detection
- Multi-symbol monitoring
- Alert system
Advanced Features (Phase 7)
- Multi-dimensional OHLC matching
- Proximity Forest ensemble
- Pattern discovery via clustering

Troubleshooting

Issue: Pattern library not loading

Solution: Make sure you've saved at least one pattern first

Issue: Tests failing

Solution: Make sure all dependencies are installed: pip install -r requirements.txt

Issue: Streamlit error on startup

Solution: Check that you're in the project root directory and virtual environment is activated

Technical Details

DTW Library: Uses aeon (state-of-the-art time series analysis)
Normalization: Z-score normalization for scale-invariance
Storage: Pickle-based persistence for pattern library
UI Framework: Streamlit with Plotly charts

Support

For issues or questions:

Check DESIGN_DTW_KNN.md for design details
Check IMPLEMENTATION_SUMMARY.md for implementation status
Review test files for usage examples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pattern Recognition System - Quick Start Guide

Overview

Installation

Running the Application

User Workflow

Step 1: Download Data

Step 2: Label Patterns

Step 3: Augment & Build Index

Step 4: Train & Validate (NEW)

Step 5: Scan for Patterns (NEW)

Step 6: Browse Library

Pattern Recognition Features

DTW (Dynamic Time Warping)

KNN Classification

Confidence Scoring

Configuration

Testing

Project Structure

Next Steps

Troubleshooting

Technical Details

Support

FilesExpand file tree

QUICKSTART.md

Latest commit

History

QUICKSTART.md

File metadata and controls

Pattern Recognition System - Quick Start Guide

Overview

Installation

Running the Application

User Workflow

Step 1: Download Data

Step 2: Label Patterns

Step 3: Augment & Build Index

Step 4: Train & Validate (NEW)

Step 5: Scan for Patterns (NEW)

Step 6: Browse Library

Pattern Recognition Features

DTW (Dynamic Time Warping)

KNN Classification

Confidence Scoring

Configuration

Testing

Project Structure

Next Steps

Troubleshooting

Technical Details

Support