Skip to content

Create a utility rate_gold function #3

@mbernst

Description

@mbernst

I'm thinking about ways to enable traditional data capture workflows. Gold standards are a super common option. What if we created another utility rate_gold function? It takes in a dict input->correct answer as an argument, and then the utility function looks to see if any inputs from the tasks match the input, and if so check if the response matches a gold. It then takes sum(correct) and sends that to the rate() function.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions