Tick-Level Order Books: Technical Product Overview

This article provides a technical description of Kaiko’s recently launched tick-by-tick historical order book product.

Contents:

  1. L3 Data in Cryptocurrency Markets
  2. Order Book Snapshots vs. Tick-Level Order Books
  3. Kaiko’s Tick-level Order Book Data
  4. Use Cases
  5. Request Data Sample

L1 vs. L2 vs. L3 Order Book Data

Level 1: L1 data refers to the best bid / best ask of a trading pair’s order book. This data is commonly referred to as “quote” data, and is accessed in real-time. Traders subscribing to L1 quote data receive real-time changes to the best bid / best ask of an order book, which includes the price level and order amount.

Level 2: L2 data is more granular than L1 data, and includes bids and asks at all price levels of a trading pair’s order book. This data is commonly referred to as “depth of book” data. In cryptocurrency markets, market makers place limit orders at price levels that differ from the current price of an asset. L2 data includes all bids and asks placed by these market makers, aggregated by price level. For example, if 10 traders placed bids of various sizes at $8,000 while the current price of BTC was $8,500, this would appear as an L2 quote of [price: $8,000, amount: 23.4]. The example amount of 23.4 would be the sum of all orders placed by the 10 traders. Because L2 data is aggregated by price level, it can also be referred to as “market by price level” data.

Level 3: L3 data provides even deeper information than L2 data. L3 data refers to non-aggregated bids and asks placed by individual market makers. An L3 data feed would include every individual bid and ask, whereas an L2 data feed would include these bids and asks aggregated by price level. For example, L3 data for the quotes placed by the 10 traders above would result in 10 different data points with each individual order amount, rather than a single data point aggregated by price level. Thus, L3 data can be referred to as “market by order” data.

Each consecutive level of data provides an increased level of granularity. Some traders may find L1 data sufficient for their trading strategies, while others require L2 data. L3 data is the most granular level of order book data, frequently used by the most sophisticated traders.

L3 Data in Cryptocurrency Markets

Why is this the case? One reason may be the sheer volume of data. At each individual price level, there can be dozens (at times, hundreds) of individual market makers placing bids or asks. L2 data transforms these orders into a single data point while L3 data consists of raw, individual orders. Most traders find L2 data sufficient for their use case, which may be why exchanges frequently do not provide this data unaggregated. L3 data feeds can at times be unwieldy from these three exchanges, which implies that exchanges struggle to ensure stable WebSocket connections for this data type.

There is currently no strong consensus across exchanges for defining this data type. Of the three exchanges that provide an L3 data feed, each is labeled differently. Coinbase calls their L3 feed the ‘Full’ channel, Bitfinex calls their feed the ‘Raw Book’ channel, and Bitstamp calls their feed the ‘Live Orders’ channel. All three of these feeds have completely different formatting. However, the one commonality is that this data comprises non-aggregated, raw bids and asks.

At Kaiko, we hope to create a common language around L1, L2, and L3 data and define its role in cryptocurrency markets.

Tick-level Order Book Data

Since 2015, we have collected what we call “order book snapshots” from exchanges’ REST APIs. Twice per minute, we make requests to each exchange’s order book API endpoint to collect all bids and asks at a moment in time live on a trading pair’s order book. We call this data an “order book snapshot” because it comprises all bids and asks at a moment in time. The data is essentially stored as a long list of bids and asks, all with the same timestamp which refers to the moment in time when the snapshot was taken. Different timestamps correspond to different snapshots.

An example of an order book snapshot. Each row is either a bid or an ask.

Order book snapshots allow traders and researchers to replicate order books as one would view them on an exchange, but only in 30 second increments (which is the frequency that we currently take snapshots). In between the time a snapshot is taken, it is impossible to know if changes were made to orders or orders were added or removed. For liquid markets, this can be problematic because frequently hundreds, if not thousands, of changes are made every minute. In particular, high frequency traders often place limit orders that are executed within milliseconds. Order book snapshots are simply not granular enough to capture these rapid changes to an exchange’s order book.

Thus, we decided to build a tick-level order book product to ensure that our sophisticated clients have the granular data they need. Tick-level order book data is collected from exchange WebSocket feeds. Rather than make requests in 30-second increments, we simply subscribe to the order book feed and receive a constant stream of all updates made to the order book. Each data packet we receive is either a new order, deleted order, or changed order that can be “added” or “subtracted” from an order book’s current state.

A tick-level order book data set includes every order update. A different timestamp is assigned to each update to the book. Sequence IDs are also included for determining the exact order of events.

In order to maintain an internal order book, you need an order book snapshot as a base point from which to add and subtract the “ticks.” Thus, for each tick-level WebSocket feed we receive data from, we are also polling the order book REST API endpoint to collect an order book snapshot. All ticks come with either a millisecond (or microsecond) timestamp from the exchange and/or a sequence ID. These timestamps/sequence IDs are crucial for maintaining the correct sequence of events. When adding/subtracting ticks from an order book snapshot, the order in which they are processed by an exchange must be taken into account.

To summarize, order book snapshots and tick-level updates can be used in tandem to rebuild an order book at any given point in time. For example, with just one order book snapshot and all tick-level updates collected, you can replay every single change made to the order book over the time interval you are interested in.

Kaiko’s Tick-level Order Book Data

This dataset comes in two parts: one file includes all tick-level updates, and one file includes order book snapshots taken once per hour. These two datasets can be combined to rebuild historical market states.

Collection: We began collecting this data on May 21. Ticks are collected from each exchange’s L3 WebSocket channel. Order book snapshots are taken once per hour, collected from each exchange’s order book REST API endpoint.

Normalization: The data is not normalized due the large differences in formatting across exchanges. Each raw data packet collected includes details about the individual bid or ask placed by the market maker.

File Structure: Each individual data packet is stored in rows in compressed .csv files. One file is generated per day, for both tick updates (which we call “events”) and snapshots.

Delivery: Data is delivered through either AWS or GCP. Files for both “events” and “snapshots” will be delivered, as shown below. We can configure weekly delivery so that once per week, the previous week’s worth of L3 data is automatically pushed to your cloud bucket.

The file structure within an Amazon Web Services bucket.

Every exchange has different formatting, and we include examples of the diverging formats below. Because not every “event” has the same format, we do not include headers in the files. You can view details about headers on our documentation for this data type here.

Coinbase L3 Data
Bitstamp L3 Data
Bitfinex L3 Data

Use Cases for Tick-level Order Book Data

Some traders want to maintain internal order books, which can be done by applying the tick-level orders to an order book snapshot, which we provide in a companion file. Some analyses can be performed with just the raw ticks, but others will need to fully recreate order book states, which means that an original snapshot is necessary to apply all tick-level order book updates. There are simple algorithms described by exchanges for maintaining an internal order book (Coinbase’s documentation includes one here).

Request Data Sample

We are eager for any and all feedback, including any comments about our data formatting, details about your use case, additional exchanges to begin collection for, etc.

To read our original product announcement, click here. To view our data dictionary and documentation, click here.

Additional Information:

Data Dictionary: https://www.kaiko.com/pages/cryptocurrency-data-types

Full Documentation: https://docs.kaiko.com/#introduction

API Documentation: https://www.kaiko.com/pages/market-data-api

Historical Data: https://www.kaiko.com/pages/historical-data

Newsletter: http://eepurl.com/c7WUVv

Charts and Analytics: https://www.kaiko.com/pages/research-factsheet

Blog: https://blog.kaiko.com/

Twitter: https://twitter.com/KaikoData

Contact Form: https://www.kaiko.com/pages/contact-kaiko

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store