Making Snowplow schemas flexible – our technical approach


In the last couple of months we’ve been doing an enormous amount of work to make the core Snowplow schema flexible. This is an essential step to making Snowplow an event analytics platform that can be used to store event data from:

  1. Any kind of application. The event dictionary, and therefore schema, for a massive multiplayer online game, will look totally different to a newspaper site, which will look different to a banking application
  2. Any kind of connected device. The types of events you get from a SmartMeter will be different to those from a mobile phone

Making schemas flexible is not enough however. We need to make it possible for a business to evolve their schema over time, as for example, their website, apps and products evolve. It is also essential to enabling us to load the data into structured data stores for easy querying.

The presentation below was given by Alex at the Budapest Big Data Meetup last night. I thought it would be useful to share with the wider Snowplow community, so you all have a better idea what we plan to launch in the next few weeks, and some of the thinking behind it.

Big data meetup budapest adding data schemas to snowplow from yalisassoon

Interested in warehousing your event data?

Then get in touch with the Snowplow team.