Data Platform archives

59 posts

Accelerating data science with Apache Spark and GPUs

By Giulia Lanzafame, 26 June 2025

Apache Spark has always been very well known for distributing computation among multiple nodes using the assistance of partitions, and CPU cores have always...

Apache Spark security: start with a solid foundation

By Giulia Lanzafame, 10 June 2025

Everyone agrees security matters – yet when it comes to big data analytics with Apache Spark, it’s not just another checkbox. Spark’s open source Java...

What is geopatriation?

By Matthew de Klerk, 20 May 2025

Geopatriation refers to the relocation of workloads and applications from global cloud hyperscalers to regional or national alternatives due to geopolitical...

Building an end-to-end Retrieval- Augmented Generation (RAG) workflow 

By Michelle Anne Tabirao, 15 May 2025

In this guide, we will take you through setting up a RAG pipeline. We will utilize open source tools such as Charmed OpenSearch for efficient search retrieval...

How does MongoDB work?

By Michelle Anne Tabirao, 2 April 2025

Explore what MongoDB is, how it functions, and how organizations utilize it for specific applications to achieve business benefits.

Valkey container image – securely designed, compliant, and long term supported (LTS)

By Michelle Anne Tabirao, 25 February 2025

Canonical has published best-in-class OCI-compliant Valkey container images that feature enhanced security. These images are based on Ubuntu LTS packages,...

Building RAG with enterprise open source AI infrastructure

By Michelle Anne Tabirao, 20 December 2024

How to create a robust enterprise AI infrastructure for RAG systems using open source tooling?A highlight on how open source can help

Get Valkey security patching and support with Ubuntu Pro

By Michelle Anne Tabirao, 16 December 2024

Canonical is pleased to announce security patching and support for Valkey through the Ubuntu Pro subscription. Ubuntu Pro is a subscription service for...

How does OpenSearch work?

By Michelle Anne Tabirao, 13 December 2024

How does opensearch work? OpenSearch is an open-source search and analytics suite. Developers build solutions for search and more!

What is RAG?

By Michelle Anne Tabirao, 13 December 2024

RAG explained: is a technique that enhances generative AI models by utilizing external knowledge sources such as documents and extensive databases.

Spark or Hadoop: the best choice for big data teams?

By Giulia Lanzafame, 10 December 2024

I always find the Olympics to be an unusual experience. I’m hardly an athletics fanatic, yet I can’t help but get swept up in the spirit of the competition....

Apache Spark 4.0 beta release – try it now

By robgibbon, 15 October 2024

Apache Spark is a popular framework for developing distributed, parallel data processing applications. Our solution for Apache Spark on Kubernetes has made...

  1. Previous page
  2. 1
  3. 2
  4. 3
  5. 4
  6. 5
  7. Next page