hangout 版本 1.3.1 kafka 0.10

kafka 故障后,hangout 报错

I0418 01:33:27.948472       1 bulk_http.go:144] bulk 15457 docs with execution_id 45326
I0418 01:33:27.948537       1 bulk_http.go:154] try to bulk with host (http://10.191.68.23:9200/_bulk)
I0418 01:33:28.980086       1 bulk_http.go:161] bulk done with execution_id 45326 1.032 15457 14977.712
I0418 01:37:27.950759       1 brokers.go:131] got 25 brokers
E0418 01:37:29.849768       1 simple_consumer.go:371] fetch error:read tcp 10.191.66.203:43008->10.191.54.224:9092: i/o timeout
E0418 01:37:29.849808       1 fetch_response.go:284] could not read enough bytes(4) to get fetchresponse length. read 0 bytes
I0418 01:37:29.949877       1 broker.go:177] broker  se.net:9092 is dead, (re)open it after sleep 200ms
E0418 01:38:00.651407       1 simple_consumer.go:371] fetch error:read tcp 10.191.66.203:48402->10.191.54.224:9092: i/o timeout
E0418 01:38:00.651467       1 fetch_response.go:284] could not read enough bytes(4) to get fetchresponse length. read 0 bytes
I0418 01:38:00.751551       1 broker.go:177] broker nease.net:9092 is dead, (re)open it after sleep 200ms
E0418 01:38:31.452661       1 simple_consumer.go:371] fetch error:read tcp 10.191.66.203:48446->10.191.54.224:9092: i/o timeout
E0418 01:38:31.452716       1 fetch_response.go:284] could not read enough bytes(4) to get fetchresponse length. read 0 bytes
I0418 01:38:31.552793       1 broker.go:177] broker .net:9092 is dead, (re)open it after sleep 200ms
E0418 01:39:02.285946       1 simple_consumer.go:371] fetch error:read tcp 10.191.66.203:48506->10.191.54.224:9092: i/o timeout
E0418 01:39:02.285993       1 fetch_response.go:284] could not read enough bytes(4) to get fetchresponse length. read 0 bytes
I0418 01:39:02.386042       1 broker.go:177] broker ease.net:9092 is dead, (re)open it after sleep 200ms
E0418 01:39:33.138383       1 simple_consumer.go:371] fetch error:read tcp 10.191.66.203:48588->10.191.54.224:9092: i/o timeout
E0418 01:39:33.138415       1 fetch_response.go:284] could not read enough bytes(4) to get fetchresponse length. read 0 bytes
I0418 01:39:33.238536       1 broker.go:177] broker i.nease.net:9092 is dead, (re)open it after sleep 200ms
E0418 01:40:03.945554       1 simple_consumer.go:371] fetch error:read tcp 10.191.66.203:48686->10.191.54.224:9092: i/o timeout
E0418 01:40:03.945592       1 fetch_response.go:284] could not read enough bytes(4) to get fetchresponse length. read 0 bytes
I0418 01:40:04.045787       1 broker.go:177] broker .i.nease.net:9092 is dead, (re)open it after sleep 200ms
省略重复
E0418 04:37:00.279222       1 brokers.go:269] get metadata of [suez_g66_logtail] from canal-sssez.i.nease.net:9092 error: There is no leader for this topic-partition as we are in the middle of a leadership election.
E0418 04:37:00.323861       1 brokers.go:269] get metadata of [suez_g66_logtail] from canal-kafssssase.net:9092 error: There is no leader for this topic-partition as we are in the middle of a leadership election.
E0418 04:37:00.387194       1 brokers.go:269] get metadata of [suez_g66_logtail] from canal-sss9-sssease.net:9092 error: There is no leader for this topic-partition as we are in the middle of a leadership election.
省略重复
E0418 06:28:36.474490       1 brokers.go:269] get metadata of [suez_g66_logtail] from canal-kafka14-suez.i.nease.net:9092 error: There is no leader for this topic-partition as we are in the middle of a leadership election.
E0418 06:28:36.675459       1 brokers.go:269] get metadata of [suez_g66_logtail] from canal-kafka10-suez.i.nease.net:9092 error: There is no leader for this topic-partition as we are in the middle of a leadership election.
kafka 故障恢复后,hangout 一直不消费,刷这个,强制重启后开始消费
I0418 06:28:36.876990       1 group_consumer.go:235] sync logstash_g66_logtail_11 error: The coordinator is not aware of this member.
I0418 06:28:36.877011       1 group_consumer.go:169] try to join group logstash_g66_logtail_11
I0418 06:28:36.877333       1 group_consumer.go:190] join logstash_g66_logtail_11 error: The coordinator is not aware of this member.
省略重复
I0419 00:00:00.001397       1 group_consumer.go:190] join logstash_g66_logtail_11 error: The coordinator is not aware of this member.
I0419 00:00:00.001403       1 group_consumer.go:169] try to join group logstash_g66_logtail_11

补充下时间点,kafka 是18日凌晨1点30全挂了,早上8点恢复,但是这个不消费是 19日22点发现并重启的,数据分区没有rotate,但是从最新读了,而且后面补数据重新指定offset 是可以消费的。