I’m trying to test CrateDB’s performance for some telematics data, and am looking to set up a 3-node cluster on AWS EC2. But for some reason, I can’t get the node discovery to work. I’m starting with a 2-node cluster but have hit an issue.
I set up a single node no problem on an instance, loaded it up with data, then made another copy from an image of this working instance. My crate.yml file looks like this:
gateway.expected_nodes: 2
gateway.recover_after_nodes: 2
network.host: _site_
discovery.seed_hosts:
- <node1_ip>:4300
- <node2_ip>:4300
cluster.initial_master_nodes:
- <node1_ip>:4300
- <node2_ip>:4300
cluster.name: crate
Where node1_ip and node2_ip are the instance private IPs. All other settings are defaults.
I checked the networking and can confirm with telnet and netcat that each instance can connect to the other on port 4300. Is there something else I should be doing?
I also tried the EC2 discovery method using the name of the security group, but that didn’t work either.
Any help would be great - I’m sure I’m doing something dumb. Note, I’ve replaced the IP with crate1, but the true printout contains the actual instance IP.
Both nodes have the following output:
ubuntu@ip-crate1:~/crate-4.0.10$ ./bin/crate
OpenJDK 64-Bit Server VM warning: Option UseConcMarkSweepGC was deprecated in version 9.0 and will likely be removed in a future release.
[2020-02-03T17:35:24,121][INFO ][o.e.e.NodeEnvironment ] [Pilatus] using [1] data paths, mounts [[/ (/dev/nvme0n1p1)]], net usable_space [455.7gb], net total_space [969.3gb], types [ext4]
[2020-02-03T17:35:24,125][INFO ][o.e.e.NodeEnvironment ] [Pilatus] heap size [15.9gb], compressed ordinary object pointers [true]
[2020-02-03T17:35:24,280][INFO ][o.e.n.Node ] [Pilatus] node name [Pilatus], node ID [lLw7phjsT1CqEhY-0PT5fg]
[2020-02-03T17:35:24,289][INFO ][o.e.n.Node ] [Pilatus] version[4.0.10], pid[10586], build[0e162d0/2019-12-10T12:41:32Z], OS[Linux/4.15.0-1057-aws/amd64], JVM[Ubuntu/OpenJDK 64-Bit Server VM/11.0.6/11.0.6+10-post-Ubuntu-1ubuntu118.04.1]
[2020-02-03T17:35:24,584][INFO ][i.c.plugin ] [Pilatus] plugins loaded: [jmx-monitoring, lang-js, enterpriseFunctions]
SLF4J: Failed to load class “org.slf4j.impl.StaticLoggerBinder”.
SLF4J: Defaulting to no-operation (NOP) logger implementation
SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details.
[2020-02-03T17:35:25,892][INFO ][o.e.p.PluginsService ] [Pilatus] no modules loaded
[2020-02-03T17:35:25,898][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [crate-azure-discovery]
[2020-02-03T17:35:25,899][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [es-repository-hdfs]
[2020-02-03T17:35:25,899][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [io.crate.plugin.BlobPlugin]
[2020-02-03T17:35:25,899][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [io.crate.plugin.CrateCommonPlugin]
[2020-02-03T17:35:25,899][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [io.crate.plugin.HttpTransportPlugin]
[2020-02-03T17:35:25,899][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [io.crate.plugin.PluginLoaderPlugin]
[2020-02-03T17:35:25,900][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [io.crate.plugin.SrvPlugin]
[2020-02-03T17:35:25,900][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [io.crate.udc.plugin.UDCPlugin]
[2020-02-03T17:35:25,900][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [org.elasticsearch.analysis.common.CommonAnalysisPlugin]
[2020-02-03T17:35:25,900][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [org.elasticsearch.discovery.ec2.Ec2DiscoveryPlugin]
[2020-02-03T17:35:25,901][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [org.elasticsearch.plugin.repository.url.URLRepositoryPlugin]
[2020-02-03T17:35:25,901][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [org.elasticsearch.repositories.azure.AzureRepositoryPlugin]
[2020-02-03T17:35:25,901][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [org.elasticsearch.repositories.s3.S3RepositoryPlugin]
[2020-02-03T17:35:25,901][INFO ][o.e.p.PluginsService ] [Pilatus] loaded plugin [org.elasticsearch.transport.Netty4Plugin]
[2020-02-03T17:35:27,062][INFO ][o.e.d.DiscoveryModule ] [Pilatus] using discovery type [zen] and seed hosts providers [settings]
[2020-02-03T17:35:28,019][INFO ][psql ] [Pilatus] PSQL SSL support is disabled.
[2020-02-03T17:35:28,240][INFO ][i.c.p.PipelineRegistry ] [Pilatus] HTTP SSL support is disabled.
[2020-02-03T17:35:28,300][INFO ][o.e.n.Node ] [Pilatus] initialized
[2020-02-03T17:35:28,300][INFO ][o.e.n.Node ] [Pilatus] starting …
[2020-02-03T17:35:28,977][INFO ][psql ] [Pilatus] publish_address {crate1:5432}, bound_addresses {crate1:5432}
[2020-02-03T17:35:29,101][INFO ][i.c.p.h.CrateNettyHttpServerTransport] [Pilatus] publish_address {crate1:4200}, bound_addresses {crate1:4200}
[2020-02-03T17:35:29,149][INFO ][o.e.t.TransportService ] [Pilatus] publish_address {crate1:4300}, bound_addresses {crate1:4300}
[2020-02-03T17:35:29,155][INFO ][o.e.b.BootstrapChecks ] [Pilatus] bound or publishing to a non-loopback address, enforcing bootstrap checks
[2020-02-03T17:35:29,313][INFO ][o.e.c.s.MasterService ] [Pilatus] elected-as-master ([1] nodes joined)[{Pilatus}{lLw7phjsT1CqEhY-0PT5fg}{_T7XdZlUSAeaCAz66nh8ug}{crate1}{crate1:4300}{http_address=crate1:4200} elect leader, BECOME_MASTER_TASK, FINISH_ELECTION], term: 24, version: 151285, reason: master node changed {previous [], current [{Pilatus}{lLw7phjsT1CqEhY-0PT5fg}{_T7XdZlUSAeaCAz66nh8ug}{crate1}{crate1:4300}{http_address=crate1:4200}]}
[2020-02-03T17:35:29,761][INFO ][o.e.c.s.ClusterApplierService] [Pilatus] master node changed {previous [], current [{Pilatus}{lLw7phjsT1CqEhY-0PT5fg}{_T7XdZlUSAeaCAz66nh8ug}{crate1}{crate1:4300}{http_address=crate1:4200}]}, term: 24, version: 151285, reason: Publication{term=24, version=151285}
[2020-02-03T17:35:29,774][INFO ][o.e.n.Node ] [Pilatus] started