It was great to have the opportunity to talk at London NoSQL earlier this week on Snowplow’s journey from NoSQL to SQL, and then back to a hybrid model supporting multiple storage targets. Many thanks to Couchbase developer evangelist Matthew Revell for inviting me!
My talk took us through Snowplow’s journey from using NoSQL (via Amazon S3 and Hive), to columnar storage (via Amazon Redshift and PostgreSQL), and most recently to a mixed model of NoSQL and SQL, including S3, Redshift and Elasticsearch. Preparing for the talk was also a great opportunity for me to really think through and write down Snowplow’s evolution over the last two years from a data storage perspective.
My slides are here:
It was a great audience, who asked some excellent technical questions about Snowplow, data schemas and event analytics following the talk. I feel like we’re at the cusp of a lot of interesting new developments around event and entity storage at Snowplow - I look forward to revisiting this topic in 2015 and seeing how much has changed!