What would be the basic optimization techniques/parameters for a time series table in this example?

  • Typically one should aim for a single shard size of about 10-50 GiB
  • Partitions can be deleted/dropped like a table. i.e. if you want to delete old data quickly, you should partition by e.g. month
  • typically having less shards is more performant, than having too many

@jayeff wrote a basic guide for time series data

also some core information on sharding:

2 Likes