Union without ALL
Condition
Query in RUNNING status contains UNION instead of UNION ALL, and it aggregates a large number of rows. 
How to fix
Use
UNION ALLinstead ofUNION. If you actually need de-duplication, consider usingGROUP BYorDISTINCTon sub-query with multiple tables combined byUNION ALL.Process data in smaller chunks.
Example
    UnionWithoutAllCondition(
        min_input_rows=10_000_000,  # at least 10M input rows for UNION without ALL
        notice_duration=60 * 10,  # 10 minutes for notice
    ),Specific arguments
min_input_rows (int) - how many rows should go into
Aggregatenode caused byUNION(recommended min value is at least 1M to prevent false positives)
Last updated