Enable and configure your enrichments directly in the Snowplow UI

09 September 2019  •  Lyuba Golovina
Overview Enrichments are one of Snowplow’s most powerful features. Not all the data that describes an event is available at the place that event is tracked. Enrichments make it possible to enhance your dataset with additional details by adding first and third-party data points to each event, in real-time, further down the data collection pipeline. For example, imagine you are an e-commerce site selling a green dress internationally. The same green dress will be sold...

Snowplow Snowflake Loader 0.5.0 released

02 September 2019  •  Enes Aldemir
We are pleased to announce version 0.5.0 of the Snowplow Snowflake Loader! This release introduces bad row support with its new format. Read on below the fold for: Bad row support Upgrading Getting help 1. Bad Row Support Data quality has always been a huge focus for Snowplow. Being a non-lossy data pipeline is one of the central pieces of this data quality puzzle. Indeed, other pieces such as data validity (through our schema validation...

How Animoto uses event tracking data to understand and optimize the user journey

02 September 2019  •  Alex Beskin & Jason Bellinger
“How are people using my site?” “What is a typical customer’s journey?” “How many times does a user visit before purchasing? And how many pages do they visit? Which one had the biggest impact on their decision?” These are a few questions that data analysts hear all the time from marketing, product and finance teams. The concepts of a “customer journey”, “clickstream analysis”, and “multi-touch attribution” have been around for a long time and are...

Mapping the customer journey with complete-picture data to reach the single customer view

29 August 2019  •  Erika Wolfe
With data unification - the ability to join multiple data sources and get a full-picture view - travel companies are harnessing the power of understanding their customer journey at a more granular level, finally tapping into insight untethered from previously siloed sources. This creates conditions for “hyper-relevance” as a differentiator in a crowded travel market dominated by big, household names. Data joined together from across the customer journey - whether from web and mobile, from...

Snowplow RDB Loader R31 released

27 August 2019  •  Anton Parkhomenko
We are pleased to announce the release of Snowplow RDB Loader R31. This updates the format of bad rows emitted to a new format, as part of a broader piece of work we have been undertaking to improve Snowplow’s capabilities around bad row handling, debugging and recovery; and fixes a data quality issue with synthetic duplicates in the RDB Shredder. Please read on after the fold for: New bad rows format Synthetic duplicates issue Other...

Mental health in the workplace: how Snowplow supports mental well-being

27 August 2019  •  Nicki Faulkner
Sometimes it can be really obvious when we should take a sick day from work: broken bones, food poisoning, migraines, contagious coughs and sneezes. However, when we have poor mental health, at which point are we too unwell to come to work? We’re all guilty of thinking “I should go into work today, I could definitely be feeling worse!” but at the same time, we could also be feeling better and maybe it’s at this...

Iglu Scala Client 0.6.0 released

09 August 2019  •  Anton Parkhomenko
We’re tremendously excited to announce the new 0.6.0 release of the Iglu Scala Client, a library in charge of schema resolution and data validation in all Snowplow components, including enrichment jobs and loaders. This release brings enormous amount of API changes we’ve made in order to facilitate implementation of Snowplow Platform Improvement Proposals, including new bad rows format, Amazon Redshift automigrations and deprecation of a batch pipeline. In the rest of this post we will...

Time spent is the most important metric for media, here’s how to get it right

07 August 2019  •  Simon Rumble
The actual product media companies sell is the engagement and attention of their audience yet the way it’s commonly measured is completely broken. Let’s look at the problem of measuring attention, ways to solve it and explore some examples of media companies doing it well. Media companies trade in the attention of audiences. Audiences visit them—and hopefully pay them—to inform and entertain. Advertisers pay them to get the attention of those audiences. If there’s one...

Create and update your event and entity schemas quickly and easily in the Snowplow UI

29 July 2019  •  Yali Sassoon
Schemas are one of the most powerful Snowplow Insights features. To date they have been hard to create and update. Now this can be done easily, directly in the Snowplow UI. Overview One of the most powerful features in Snowplow are schemas. Schemas make it possible for: Each Snowplow user to define their own events and entities, so that their Snowplow data presents a clear and easy-to-understand record of what has happened. Each Snowplow user...

How to ensure your data collection evolves alongside your business

23 July 2019  •  Lyuba Golovina
Modern businesses evolve quickly. While twenty years ago it was common to update a website every few years, now modern digital businesses make changes to their websites and applications weekly, or even daily. At the same time, most data-driven businesses would agree that collecting data is an essential part of optimizing the user experience. This creates a data collection challenge: the faster companies evolve their websites and apps, the harder it becomes to reliably collect...