Search⌘ K
AI Features

Avro: Intro

Explore the Avro data serialization system, focusing on its key features like embedded JSON schemas and schema-less file reading. Understand how to define schemas, create Avro files, and read records using GenericRecord. Gain practical experience with Avro tools for schema extraction and efficient data handling in Hadoop.

We'll cover the following...

Avro

Avro is a data serialization system that allows for remote procedure calls and the fast and compact serialization of data. The defining feature of Avro is a schema always embedded within an avro file. It allows one to read the file without knowing the schema before-hand. The name Avro is borrowed from a defunct British aircraft manufacturer.

A producer that writes records to an avro file must specify the schema that describes the structure of each record. That avro schema is expressed in JSON. However, a higher abstraction, the Avro IDL language, lets developers specify schemas in a ...