Question 1

How should I explain G1 vs ZGC in a Java system design interview?

Accepted Answer

Position G1 as a server GC optimized for predictable pauses via regionalized, incremental compaction, and ZGC as a low-latency GC with concurrent relocation and sub-10ms pauses at large heaps. Tie the choice to SLOs: G1 for balanced throughput + predictable pauses; ZGC when p95/p99 latency is king on big heaps.

Question 2

What JVM memory model and tuning points should I cover?

Accepted Answer

Hit heap sizing (-Xms/-Xmx), young/old generations, metaspace, thread stacks, and safepoints. Mention GC logs/metrics, allocation rate, promotion failures, and how you iterate: measure → change one knob → re-measure under production-like load.

Question 3

How do I reason about throughput vs latency trade-offs on the JVM?

Accepted Answer

Explain that larger batches, fewer context switches, and G1 throughput tuning raise TPS but risk longer pauses, while ZGC, smaller batches, and tighter timeouts protect p95/p99 latency at some throughput cost. Always anchor to SLAs and real traffic shapes.

Question 4

How should I size Executors and thread pools in Java?

Accepted Answer

For CPU-bound work, start near NCPU (or NCPU ± 1). For I/O-bound tasks, size by blocking ratio; consider virtual threads (Java 21) to simplify concurrency without ballooning platform threads. Cap queues, surface backpressure, and measure saturation.

Question 5

When should I choose CompletableFuture over raw threads in Java?

Accepted Answer

Use CompletableFuture for async composition, timeouts, and non-blocking pipelines; fall back to plain threads for simple, bounded tasks. Prefer structured cancellation and combine with timeouts and bulkheads to avoid runaway fan-out.

Question 6

How do backpressure patterns work in Reactive Java (Project Reactor)?

Accepted Answer

Show demand signaling (request(n)), buffer/ drop policies (onBackpressureBuffer/Drop/Latest), and boundaries (publishOn/subscribeOn). Emphasize measuring queues and using timeouts + retries to prevent slow subscribers from collapsing the pipeline.

Question 7

What are Spring Boot microservice design best practices in Java?

Accepted Answer

Keep services small with clear boundaries, externalize config, use Actuator for health/metrics, validate input, and secure defaults. Add idempotency, retries, and circuit breakers; prefer immutable DTOs and contract tests.

Question 8

How do service discovery, config server, and API gateway trade off in Java?

Accepted Answer

Service discovery (Eureka/Consul/K8s DNS) removes hard-coded endpoints, a Config Server centralizes versioned config, and an API Gateway (Spring Cloud Gateway) handles routing, auth, and rate limits. Cost: more hops/operational complexity; benefit: control and safety.

Question 9

When should Java services use REST vs Kafka between services?

Accepted Answer

Use REST for synchronous request/response and user-facing actions; use Kafka for async workflows, decoupling, and retries at scale. Often you blend both: REST to accept the command, Kafka to orchestrate downstream side effects.

Question 10

How do I choose between SQL and NoSQL for Java services?

Accepted Answer

Pick SQL for strong consistency, joins, and transactions; pick NoSQL for high write throughput, flexible schemas, or large key-value/column workloads. Discuss data access patterns, sharding/secondary indexes, and operational maturity.

Question 11

What Redis caching patterns and invalidation strategies work in Java?

Accepted Answer

Use cache-aside for read-mostly, read-through/write-through when you need atomicity, and TTLs to limit staleness. Prevent stampedes with locks or request coalescing, and invalidate on authoritative writes or via pub/sub.

Question 12

How should I manage sessions in Java: stateless or sticky?

Accepted Answer

Prefer stateless (JWT or server-side token + shared store) for scale and resilience. Use sticky sessions only when legacy constraints demand it, and mitigate with a central session store and short lifetimes.

Question 13

How do I design token-bucket or leaky-bucket rate limiting in Java?

Accepted Answer

Implement per-key buckets with Redis/Lua or in-JVM buckets plus persistence for multi-node fairness. Expose Retry-After, enforce timeouts, and integrate with gateway policies.

Question 14

What Kafka fundamentals should I mention in a Java interview?

Accepted Answer

Explain partitions, replication factor, consumer groups, per-partition ordering, and offset management. Tie choices to throughput, durability, and failure handling.

Question 15

How should I implement retries, timeouts, and circuit breakers in Java?

Accepted Answer

Use Resilience4j (or Spring Cloud Resilience) for timeouts, retry with jitter, bulkheads, and circuit breakers. Distinguish transient vs permanent errors, cap retries, and emit metrics.

Question 16

How do I set up Micrometer, Prometheus, and Grafana in Java?

Accepted Answer

Instrument with Micrometer counters, gauges, and histograms; expose /actuator/prometheus; scrape with Prometheus and visualize in Grafana. Track p50/p95/p99 latency and error rates per endpoint.

Question 17

How do I use OpenTelemetry for distributed tracing in Java?

Accepted Answer

Propagate W3C traceparent context, instrument HTTP and Kafka clients, export via OTLP, and sample smartly (tail or dynamic) to keep overhead low. Correlate traces with logs/metrics via trace IDs.

Question 18

How should I version REST APIs in Java?

Accepted Answer

Prefer backward-compatible changes; when breaking, use URI or header versioning and deprecate gradually. Provide contract tests and consumer-driven contracts to avoid surprises.

Question 19

When should I use JSON vs Protobuf/gRPC in a Java stack?

Accepted Answer

Use JSON/REST for web and ecosystem reach; use gRPC/Protobuf for low-latency, strongly typed, internal RPCs and streaming. Many stacks expose REST externally and gRPC internally.

Question 20

How do I approach capacity planning for QPS and p95/p99 latency in Java services?

Accepted Answer

Measure single-instance throughput, model CPU/heap/GC impact, project concurrency, and add safety margins. Validate with load tests, set SLOs, and add admission control and load shedding before saturation.

Question 21

When should I apply Strategy, Factory, or Template patterns in Java?

Accepted Answer

Use Strategy to swap algorithms at runtime, Factory to centralize creation and hide complexity, and Template Method to define skeletal workflows. Combine with Spring DI for testability and clear seams.

Java System Design Interview Questions

Personalized Interview Prep

Mock Interviews

AI Prompt

Code Feedback

Explain with AI

AI Code Mentor

Frequently Asked Questions