Thus the new Scala Common Enrich is a shared library for processing raw Snowplow events into validated and enriched Snowplow events. Common Enrich is designed to be used within a “host” enrichment process: initially our existing Scala Hadoop Enrich process, but it should be relatively straightforward to also embed this in a Kinesis application.
If you are using the existing Scalding-based Enrichment process, the only difference you should notice is the new composite
v_etl for Snowplow events: “hadoop-0.3.6-common-0.1.0”.
Assuming you are using EmrEtlRunner, you simply need to update your configuration file,
config.yml, to use the latest version of the Hadoop ETL:
And that’s it! As always, you can find more detail on the tickets in this release under the Snowplow v0.8.12 release in GitHub.