Neptune Database Fundamentals

Explore how Amazon Neptune enables management of property graph and RDF data through a scalable clustered architecture. Learn about its compute and storage layers, endpoint routing, failover processes, security with VPC and IAM, continuous backups, bulk data loading from S3, and supported query languages. This lesson helps you understand Neptune's operational fundamentals essential for designing resilient and secure graph workloads on AWS.

We'll cover the following...

Cluster architecture and endpoints
High availability, backups, and PITR
- Automated failover mechanics
  - Continuous backups and point-in-time recovery
Secure connectivity and access control
- Network-level controls
- Authentication and encryption
Bulk loading data from Amazon S3
- Neptune Loader API
  - Prerequisites and workflow
Supported query languages overview
Conclusion

In the previous lesson, you explored graph database fundamentals: property graphs, RDF triples, vertices, edges, labels, and properties. Those concepts describe what a graph database stores. This lesson shifts focus to how Amazon Neptune runs those models as a production-grade, managed service on AWS. Neptune is not simply a query endpoint you point at. It is a clustered database engine with its own distributed storage layer, dedicated compute instances, automated failover, continuous backups, and fine-grained access controls. Understanding these operational mechanics is essential for designing resilient, secure, and performant graph workloads and for answering exam questions that test whether you can distinguish Neptune's architecture from relational or NoSQL patterns.

By the end of this lesson, you will understand Neptune's cluster architecture and endpoint routing, high availability and failover behavior, continuous backups and point-in-time recovery, secure VPC-based connectivity with optional IAM authentication, bulk loading from Amazon S3, and the three supported query languages. The next lesson dives into query syntax and modeling; here, the emphasis is on operational mechanics.

Cluster architecture and endpoints

Amazon Neptune organizes resources into a DB clusterA logical grouping that contains one primary (writer) instance, up to 15 read replica instances, and a shared distributed storage volume, all managed as a single unit.. This cluster model separates compute from storage, which means you can scale each tier independently based on workload demands.

Compute layer

The compute layer consists of database instances that process queries and manage connections. A Neptune cluster always has exactly one primary instance responsible for all write operations: inserts, updates, and deletes. You can add up to 15 read replicas that serve read queries concurrently. Each replica maintains its own in-memory page cache, so adding replicas improves read throughput roughly linearly for concurrent workloads. However, replicas do not increase write throughput because all mutations still flow through the single primary.

Instance classes determine the CPU and memory available to each instance. You can choose different instance classes for the primary and replicas, allowing cost optimization where read replicas use smaller instances for lighter traversal workloads while the primary handles heavier mutation processing. ...

1.Introduction

2.Common Foundation for All AWS Database Study

Cloud Lab

3.Amazon RDS

Cloud Lab

Cloud Lab

4.Amazon Aurora

Cloud Lab

5.Amazon DocumentDB

Cloud Lab

Cloud Lab

6.Amazon DynamoDB

Cloud Lab

Cloud Lab

7.Amazon ElastiCache

Cloud Lab

8.Amazon KeySpaces

Cloud Lab

9.Amazon MemoryDB

Cloud Lab

10.Amazon Neptune

Cloud Lab

11.Amazon Timestream

Cloud Lab

12.Conclusion

Neptune Database Fundamentals

Cluster architecture and endpoints

Compute layer