Making it easy to work with event-level mobile data

23 May 2019  •  Yali Sassoon
In the last month we released new versions of the Snowplow iOS and Android SDKs. These new versions deliver a data set that is significantly easier to model and work with: helping companies use mobile data to better inform product development and marketing decisions. In this post, we’ll explain how the data delivered by these new mobile trackers is both easier to work with and better enables analysts, data scientists, marketers and product managers to...

Snowplow R114 released with new adapter, enrichment and other improvements

20 May 2019  •  Benjamin Benoist
We are pleased to release Snowplow 114 Polonnaruwa, named after the ancient city of Polonnaruwa in Sri Lanka. This Snowplow release includes a number of new features and updates, most of which live in Scala Common Enrich: New enrichment: YAUAA (Yet Another UserAgent Analyzer) New feature: remote HTTP adapter New tutorial: add an enrichment to the pipeline Other improvements Updates for EmrEtlRunner Upgrading Roadmap Getting help 1. New enrichment: YAUAA (Yet Another UserAgent Analyzer) Understanding...

Snowplow Obj-C Tracker 1.1.0 released

06 May 2019  •  Mike Hadam
We are pleased to announce a new release of the Snowplow Obj-C Tracker. Version 1.1.0 introduces new tracking features including new screen view events and screen entities (with the option to enable automatic tracking), and automatically tracked install and crash events. With these new features, Snowplow users can set up comprehensive tracking much faster enabling users to get started with mobile tracking in Snowplow quickly and easily. Read on below the fold for: Screen tracking...

Snowplow Android Tracker 1.1.0 released

06 May 2019  •  Mike Hadam
We are pleased to announce a new release of the Snowplow Android Tracker. Version 1.1.0 introduces new auto-tracking features for Snowplow mobile users including screen events and entities, install events and application context. With these new features, Snowplow users can set up comprehensive tracking much faster so your data team can gain access to valuable insights about your application within minutes. Read on below the fold for: Screen tracking Install tracking Application context Updates Documentation...

Data Science Festival: What makes an effective data team?

24 April 2019  •  Alex Dean
It was great to be able to speak at the Data Science Festival in London on April 13. DSF is an annual, week long celebration of data science that culminates in a one-day main event. It’s a place for current and future data scientists to meet, discuss challenges and opportunities and network with fellow data enthusiasts. The original topic of my talk was “Why high quality data is crucial for your machine learning models.” but...

Data Science Festival: Machine learning in real-time: the next frontier

24 April 2019  •  Alex Dean
In addition to my main talk at Data Science Festival on What makes an effective data team, I was lucky to give one of the 10 minute “Lightning talks” in the SHIFT room before lunch as well. This post briefly recaps my lightning talk on machine learning in real-time, before sharing my conference highlights and some closing thoughts. Machine learning in real-time: the next frontier For my Lightning talk I discussed “Machine learning in real-time:...

Snowplow Spotlight Benjamin Benoist

11 April 2019  •  Miriam de Medwe
Benjamin Benoist - Data Engineer based in Berlin What do you do at Snowplow? I joined Snowplow 6 months ago as a Data Engineer. During the first few months, I was working mainly for a customer to build a real-time application on top of Snowplow data. I’m now getting more involved in developments around the Snowplow pipeline. I’m currently adding my first enrichment and I’m also integrating contributions from the open source community. Why did...

Using AWS Glue and AWS Athena with Snowplow data

04 April 2019  •  Konstantinos Servis
This is a guide to interacting with Snowplow enriched events in Amazon S3 with AWS Glue. The objective is to open new possibilities in using Snowplow event data via AWS Glue, and how to use the schemas created in AWS Athena and/or AWS Redshift Spectrum. This guide consists of the following sections: Why analyze Snowplow enriched events in S3? AWS Glue prerequisites Creating the source table in Glue Data Catalog Optionally format shift to Parquet...

Guest post: 3 reasons why your company should own its data

04 April 2019  •  Jacob Thomas
This is a guest post by Jacob Thomas, Lead Data Engineer at CarGurus. You can find the original article and read more from Jacob on Bostata. When it comes to your company’s software and infrastructure, it often makes sense to buy vs. build. However, it will benefit you in the long-term to thoroughly understand and own your data management and collection. Here’s why. When it comes to software and related infrastructure, businesses get caught in...

How many of your visitors block your Snowplow tracking?

18 March 2019  •  Mike Nemirovsky
tl;dr As a company that focuses on helping businesses collect data in order to better serve their customers, we inevitably get asked about what happens when those customers don’t want to be tracked. With the usage of Ad-blockers, and in particular privacy filters, on the rise, some of our customers are seeing the effect on their data. This effect is at times perceived as a problem or a threat towards the quality of data collection....