Skip to main content

Your submission was sent successfully! Close

Thank you for signing up for our newsletter!
In these regular emails you will find the latest updates from Canonical and upcoming events where you can meet our team.Close

Thank you for contacting us. A member of our team will be in touch shortly. Close

Charmed Apache Spark Solution Tutorial

Charmed Apache Spark provides utility client applications and additional components for seamless deployment on Kubernetes. For a detailed overview of its components, see the Components overview page.

Prerequisites

While this tutorial intends to guide you as you deploy Charmed Apache Spark for the first time, it will be most beneficial if you have:

  • Experience using a Linux-based CLI
  • General familiarity with Kubernetes commands and concepts (e.g. kubectl command)
  • Familiarity with Apache Spark commands and concepts
  • A computer that meets the Minimum system requirements from the Environment setup page

Step-by-step learning experience

This tutorial is divided into multiple steps, which we recommend following in the specified order:

Step Description
1. Environment setup Prepare your tutorial environment using a Multipass VM and deploy Apache Spark with the spark-client snap.
2. Distributed data processing Explore how to process large datasets efficiently across multiple nodes.
3. Data stream processing Learn how to handle real-time data streams.
4. History server Integrate with Apache Spark History Server.
5. Monitoring with COS Set up monitoring and alerting using Canonical’s Observability Stack (COS).
6. Wrapping up Finish the tutorial by decommissioning your Charmed Apache Spark environment to free up system resources.

Last updated a day ago. Help improve this document in the forum.