I could reproduce this problem with below steps. This could be a bug in Crate 5.8.2.
Step 1: Create below table.
create table uu.records (
id long,
uid string,
recordid long,
rday TIMESTAMP WITH TIME ZONE,
recordname string,
recorddesc string,
primary key (rday,recordid,id )
) PARTITIONED BY (rday)
WITH (
"translog.durability" = 'ASYNC'
);
Step 2: Load the data from attached files using below script:
copy uu.records from '/tmp/sample_records/*.json';
Step 3: Once data is imported in the table, run the queries given in attached files. Difference between two queries is additional or id in(156501)
My expectation was that I should get same data output but I see less data in one query.
Can someone confirm this as a bug in Crate 5.8.2? IMO it should also reproduce on older versions.
records_0_04732dpi6spjed9k60o30c1g.json (55.3 KB)
records_1_04732dpi6spjed9k60o30c1g.json (51.8 KB)
records_2_04732dpi6spjed9k60o30c1g.json (52.6 KB)
records_3_04732dpi6spjed9k60o30c1g.json (55.6 KB)
Query_returning_incorrect_results.txt (17.6 KB)
Query_returning_correct_results.txt (17.6 KB)