avoid/reduce DISTINCT calls

In `bulk_updater` we do a DISTINCT on every query in question. This is needed to avoid updating 1:n relations over and over.

Issues with this:
- DISTINCT is placed there unconditionally, but in fact is only needed for back relations
- DISTINCT is a known perf smell for queries
- does not work with already sliced/limited querysets on mysql (#99 already applies a pk extraction hack for that)

Ideas for better handling:
- To apply the distinct reduction only on back relations, changes to the graph output and `_querysets_for_update` are needed with relation type backtracking, also m2m-through handling is affected by this.
- Depending on the cardinality/selectivity of a back relation set, an explicit pk extraction in python might perform better than query trickery. Problem here: we dont know the selectivity upfront, thus have to query and eval the pk set, which is costly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid/reduce DISTINCT calls #101

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

avoid/reduce DISTINCT calls #101

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions