@dashirov two great ideas!
For a dashboard I think the two most promising platforms are QuickSight which has Athena support and Redash, which similarly has Athena support. Hope to get a chance to build one in both and share the results here shortly.
In terms of a tutorial for recovering bad rows there are a couple:
- Using Hadoop Event Recovery to recover events with a missing schema
- The Hadoop Event Recovery wiki page includes a couple of examples.
Is there’s a specific type of bad row you’re looking to recover and are stuck / have questions on do open a thread with the relevant details. I’m keen that we start publishing the different JS recovery functions that we use as I suspect a handful of error types (and corresponding JS recovery functions) probably account for more than 80% of the events people want to recover.