@mike Thanks for responding. 'm reading both bad and raw events. We have used lzo encoding, and it’s list of .lzo and .loz.index files. Given that Spark is reading them fine, I thought the split lzo files are handled by Spark, I assumed it’s an encoding issue.
@ablimit, the formula good + bad = raw will not necessarily work. It depends on how you count the events. There could be many events in a payload. A single bad event in the payload will result in the whole payload ending up in bad despite the fact that the good events from the payload will also end up in good.