Snowplow for retail part 5: what can we do with data when we're well established?

06 March 2019  •  Archit Goyal
We recommend you you have read the first post in this series before diving into this one to ensure you have all the context you need! Senior management love the work of the data team so far: You’re tracking site and mobile app engagement with a host of custom events You’re tracking a host of offline conversions and can stitch these to the behavior on the site or app You have brought down marketing spend...

[Tutorial] Adding what3words reverse geocoding data to Snowplow enriched events

27 February 2019  •  Dilyan Damyanov
Snowplow’s API Request Enrichment lets us add dimensions to an incoming Snowplow event using an internal or external HTTP-based API. In this tutorial, we’ll look at how you can use the enrichment to add a ‘reverse geocoding context’ by doing a lookup against what3words.com’s API. What is what3words? what3words has divided the world into a grid of 3m x 3m squares and assigned each one a unique 3 word address. (The main Snowplow London offices...

Snowplow Spotlight: Yulia Golova

27 February 2019  •  Miriam de Medwe
Yulia Golova - Support engineer based in Portugal 1. What do you do at Snowplow? I’m a Support Engineer and part of a team that provides assistance to our Snowplow users. Mostly my job involves answering technical or practical questions, monitoring our client’s batch pipelines, adjusting them, and providing guidance through recovery processes. We also try to make sure that once initial set up is complete, our users feel comfortable to explore many more additional...

Snowplow Objective-C Tracker 1.0.0 released

27 February 2019  •  Mike Hadam
We are pleased to announce a new release of the Snowplow Objective-C Tracker. Version 1.0.0 brings dependency updates, bug fixes and support for integrating the tracker into your project with Carthage. You’ll also notice our new documentation site! Read on below the fold for: New documentation site Carthage support Other changes Upgrading Documentation and help 1. New documentation site We’ve added our mobile trackers to our new documentation site. The documentation is divided into easy...

Snowplow R112 Baalbek batch pipeline reliability improvements

20 February 2019  •  Ben Fradet
Snowplow 112 Baalbek, named after the city in Eastern Lebanon, is a release focusing on reliability improvements for the batch pipeline. Please read on after the fold for: EmrEtlRunner improvements Clojure Collector improvement Redshift (and Postgres) data model improvement Upgrading Roadmap Help BlingBling10 at the English Wikipedia CC BY-SA 3.0 1. EmrEtlRunner improvements This release is focused on improving EmrEtlRunner by adding new features and to make it more robust with respect to AWS services...

Snowplow Android Tracker 1.0.0 released

20 February 2019  •  Mike Hadam
We are pleased to announce a new release of the Snowplow Android Tracker. Version 1.0.0 introduces features to alleviate blocking issues on tracker instantiation and add control over sessions. The release also improves the robustness of foreground and background detection. You’ll also notice the introduction of our new documentation site! Read on below the fold for: New documentation site Async support Share OkHttpClient instance Session transitions Updates Documentation Getting help 1. New documentation site We’ve...

Snowplow Scala Analytics SDK 0.4.0 released

13 February 2019  •  Rostyslav Zatserkovnyi
We are excited to announce the 0.4.0 release of the Snowplow Scala Analytics SDK, a library that provides tools to process and analyze Snowplow enriched events in Apache Spark, AWS Lambda, Apache Flink, Scalding, and other JVM-compatible data processing frameworks. This release reworks the JSON Event Transformer to use a new type-safe API, and introduces several other internal changes. Read on below the fold for: Event API Using the typesafe API Additional changes Upgrading Getting...

Guest post: After looking at the data of 80 tech companies- what have I learned? Part I

13 February 2019  •  Segah A Mir
This is a guest post by Segah A. Mir, Partner and Consultant at Seattle-based Caura & Co. The past five years have given me a tremendous opportunity to see firsthand the data of over 80 VC-backed tech companies. That is close to 100 teams and 300 individuals. Naturally, I’ve got to see a lot of data — very detailed information on every transaction, activity, click, and interaction. What would be expected of me now is to go...

Resolving entities with graph databases using Neo4j

13 February 2019  •  Dilyan Damyanov
In the previous post in this series, we looked at how we can model the canonical Snowplow page_view events as a graph. We identified the various entities that make up the event and assigned each dimension of the event as a property on one of those entity nodes. We then used composable schemas to piece together a JSON schema for the event, composed of the individual schemas for each node and relationship. In the meantime,...

How server-side tracking fills holes in your data and improves your analytics

05 February 2019  •  Rebecca Lane
Client side tracking: a brief history lesson At Snowplow Analytics, we fundamentally believe that getting data collection right is one of the most important steps for deriving value from data. This is often an iterative process and the data you collect and how you collect it should evolve over time as your use cases and your analytics set up evolves and matures. While collecting data client-side is universal across our customer base, we want to...