Add fact parser validation, tests, and bulk fact loading with tests by ColtonPayne · Pull Request #100 · lab-v2/pyreason

ColtonPayne · 2026-01-21T18:18:05Z

Summary

This PR adds comprehensive input validation to the fact parser and implements bulk fact loading from CSV files with extensive test coverage.

Fact Parser Validation (`fact_parser.py`) (Issue #91 )

Added validation for empty/whitespace-only input
Validates parentheses structure and placement (Issue BUG-038: Missing Parenthesis Validation - Causes Silent Data Corruption in Fact Parser #89 )
Enforces valid predicate naming (must start with letter or underscore, alphanumeric + underscore allowed)
Validates component structure (no nested parentheses, colons, etc.)
Validates interval bounds are within [0, 1] range and are in the proper format (issue BUG-039: No Validation for Interval Bound Format - Crashes on Malformed Input #90 )
Validates interval lower <= upper bound
Prevents double negation and negation with explicit bounds
Provides clear, specific error messages for each validation failure

Bulk Fact Loading (`pyreason.py`)

Implemented add_fact_in_bulk() function to load facts from CSV files
Supports optional header row detection
Handles optional columns: name, start_time, end_time, static
Provides warnings for invalid data (malformed facts, invalid times, etc.) without crashing
Supports multiple boolean formats for static field (True/true/1/yes, False/false/0/no)

Test Coverage

333 new lines in test_fact_parser.py:
- Tests for valid fact parsing (node/edge facts, intervals, negation, etc.)
- Tests for invalid inputs (missing parentheses, empty fields, invalid characters, etc.)
- Edge cases and boundary conditions
204 new lines in test_pyreason_file_loading.py:
- Tests for bulk fact loading from CSV
- Warning validation for invalid facts
- Tests with/without headers, various static value formats
- Error handling tests

Test Data

Created example_facts.csv with comprehensive test scenarios including both valid and invalid facts
Created example_facts_no_header.csv for testing CSV without headers

Implementation Notes

All validation errors raise ValueError with descriptive messages
Bulk loading continues processing even when individual rows fail (with warnings)
Predicate validation regex: ^[a-zA-Z_][a-zA-Z0-9_]*$ (follows Python identifier rules)

🤖 Generated with Claude Code

dyumanaditya

@ColtonPayne Everything looks good except for my one comment on explicit bound inverses.

dyumanaditya · 2026-01-29T17:43:31Z

+            - `'pred@name(node)'` - invalid characters in predicate
+            - `'pred(node1,node2,node3)'` - more than 2 components
+            - `'pred(node):[1.5,2.0]'` - values out of range [0,1]
+            - `'~pred(node):[0.2,0.8]'` - negation with explicit bound


negation with explicit bound This should actually be allowed. The negation of an explicit bound is defined and has an explicit formula. Currently it is not supported but it needs to be.

The formula for the inverse of a bound [l, u] is: ~[l, u] = [1-u, 1-l]

kmukherji

Some more comments are inline.

…put-validation

Implements add_rule_from_csv() and add_rule_from_json() for bulk rule loading, following the same pattern as bulk fact loading from PR #100. Updates add_rules_from_file() with raise_errors parameter for backwards-compatible error handling. Adds comprehensive test coverage. Resolves #117 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

kmukherji

Once these comments are addressed we can merge this branch.

kmukherji · 2026-02-10T00:59:19Z

+        if raise_errors:
+            raise ValueError(f"{item_label} {idx}: Invalid end_time '{end_time_raw}'")
+        warnings.warn(f"{item_label} {idx}: Invalid end_time '{end_time_raw}', using default value")
+        end_time = 0


I think we can resolve end_time errors by assigning end_time = start_time.
What do you think?

We are currently handling invalid values for start_time and end_time by setting them to zero if the user passes a non-integer value. This is the default value provided by the Fact constructor.

# Parse start_time try: start_time = int(start_time_raw) if start_time_raw is not None and str(start_time_raw).strip() else 0

If raise_errors is false, we handle this gracefully. If raise_errors is true, we will not accept an invalid json/csv and force the user to correct the bad input.

Yeah. I am talking about the specific case where start_time is a valid non-zero integer. But end_time has an error. Should we give a warning and set it to start_time?

…put-validation

Completed

…put-validation

Add fact parser validation, tests, and bulk fact loading with tests

972b263

ColtonPayne added the AI PR contains AI Generated Code label Jan 21, 2026

ColtonPayne added 10 commits January 21, 2026 13:29

Don't hardcode default values

e498f4a

Prevent predicates from starting with a digit

0e2e395

Fix typo in MAKEFILE

cf592e2

Improve CSV loader tests

0908471

Add fact string formatting rules in docstring

5a78cea

Remove extranious f string for linter

83a2452

Fix api test file loading

ee0ea04

Add test for example with no header

f1395dc

Make invalid csv file loads raise exceptions by default

0e3db89

Upd tests

a402321

dyumanaditya self-requested a review January 29, 2026 17:39

dyumanaditya previously requested changes Jan 29, 2026

View reviewed changes

ColtonPayne added 5 commits January 30, 2026 07:18

Add support for negated interval and negated explicit true/false

d1cb309

Load facts from json instead of csv

b903281

Update file loading tests

cad2d95

Revert

3eecf32

Final cleanup

05b3748

kmukherji reviewed Feb 2, 2026

View reviewed changes

Comment thread pyreason/pyreason.py

kmukherji reviewed Feb 2, 2026

View reviewed changes

Comment thread pyreason/pyreason.py Outdated

kmukherji reviewed Feb 2, 2026

View reviewed changes

Comment thread pyreason/pyreason.py Outdated

kmukherji reviewed Feb 2, 2026

View reviewed changes

Comment thread tests/api_tests/test_files/example_facts.json

kmukherji requested changes Feb 2, 2026

View reviewed changes

Comment thread pyreason/pyreason.py Outdated

Comment thread pyreason/scripts/utils/fact_parser.py

ColtonPayne and others added 2 commits February 2, 2026 19:35

Add back csv file loading and add duplicate name checks

0c79988

Merge branch 'main' into input-validation

b1ea06c

ColtonPayne requested review from dyumanaditya and kmukherji February 4, 2026 13:35

ColtonPayne added 2 commits February 4, 2026 08:38

CSV Formatting

01a20a4

Add back load rules from file

42d54e2

ColtonPayne added 2 commits February 4, 2026 08:40

Merge branch 'input-validation' of github.com:lab-v2/pyreason into in…

f4fbcb0

…put-validation

Requrie exact header match for csv headers

3ba07ec

ColtonPayne assigned ColtonPayne and kmukherji and unassigned ColtonPayne Feb 4, 2026

ColtonPayne added the Ready for Review Awaiting PR Review label Feb 4, 2026

ColtonPayne mentioned this pull request Feb 6, 2026

Add bulk rule loading from CSV and JSON files #120

Merged

3 tasks

kmukherji reviewed Feb 10, 2026

View reviewed changes

kmukherji added Changes requested and removed Ready for Review Awaiting PR Review labels Feb 10, 2026

ColtonPayne and others added 5 commits February 10, 2026 09:44

Update examples and remove numeric string fact loading

930c32b

Merge branch 'main' into input-validation

4928d98

Add back static string bulk csv

e14eb9f

Merge branch 'input-validation' of github.com:lab-v2/pyreason into in…

8fa0ef7

…put-validation

Merge branch 'main' into input-validation

27add9f

kmukherji approved these changes Feb 10, 2026

View reviewed changes

ColtonPayne added 2 commits February 10, 2026 15:38

Set default end_time to start_time

b975b08

Merge branch 'input-validation' of github.com:lab-v2/pyreason into in…

a6fb069

…put-validation

ColtonPayne removed the Changes requested label Feb 10, 2026

ColtonPayne merged commit 7594583 into main Feb 10, 2026
3 checks passed

ColtonPayne deleted the input-validation branch March 18, 2026 20:19

Conversation

ColtonPayne commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Fact Parser Validation (fact_parser.py) (Issue #91 )

Bulk Fact Loading (pyreason.py)

Test Coverage

Test Data

Implementation Notes

Uh oh!

dyumanaditya left a comment

Choose a reason for hiding this comment

Uh oh!

dyumanaditya Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kmukherji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kmukherji left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kmukherji Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

ColtonPayne Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kmukherji Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ColtonPayne commented Jan 21, 2026 •

edited

Loading

Fact Parser Validation (`fact_parser.py`) (Issue #91 )

Bulk Fact Loading (`pyreason.py`)

dyumanaditya Jan 29, 2026 •

edited

Loading

ColtonPayne Feb 10, 2026 •

edited

Loading