Internal Schema Registry : Everything is marked as bad. It doesn’t produce any error messages that we are aware of, where would we find those? We know it is marked as bad by reading the
./logs/enrich.log
file.
When data fails validation it’s not lost, it goes to the bad events stream. Depending on what you’ve set up, you commonly you would use Elasticsearch or Athena to debug.
Each bad row will have one or more error message, which will give you an indication as to why the data failed validation. Taking a look at those will likely make the whole setup process easier for you.
Contexts : I have found this documentation which says to format it above as an object with an array of self describing objects and this documentation which states it needs to be an array of JSON (similar to what you said) gives an example in the third code block. Which one is right? I believe they are conflicting, unless I misunderstand their context and use case.
Both do agree as I read it, but I can see where you’re coming from - the first states that each individual context should be a self-describing JSON, then gives a description of the output, which is a slightly different format. The second example is probably the better one to follow since it gives the specific tracking code, rather than the first which is more about definitions.