Data Platform archives
59 posts
Accelerating data science with Apache Spark and GPUs
By Giulia Lanzafame, 26 June 2025
Apache Spark has always been very well known for distributing computation among multiple nodes using the assistance of partitions, and CPU cores have always...
Apache Spark security: start with a solid foundation
By Giulia Lanzafame, 10 June 2025
Everyone agrees security matters – yet when it comes to big data analytics with Apache Spark, it’s not just another checkbox. Spark’s open source Java...
What is geopatriation?
By Matthew de Klerk, 20 May 2025
Geopatriation refers to the relocation of workloads and applications from global cloud hyperscalers to regional or national alternatives due to geopolitical...
Building an end-to-end Retrieval- Augmented Generation (RAG) workflow
By Michelle Anne Tabirao, 15 May 2025
In this guide, we will take you through setting up a RAG pipeline. We will utilize open source tools such as Charmed OpenSearch for efficient search retrieval...
How does MongoDB work?
By Michelle Anne Tabirao, 2 April 2025
Explore what MongoDB is, how it functions, and how organizations utilize it for specific applications to achieve business benefits.
Valkey container image – securely designed, compliant, and long term supported (LTS)
By Michelle Anne Tabirao, 25 February 2025
Canonical has published best-in-class OCI-compliant Valkey container images that feature enhanced security. These images are based on Ubuntu LTS packages,...
Building RAG with enterprise open source AI infrastructure
By Michelle Anne Tabirao, 20 December 2024
How to create a robust enterprise AI infrastructure for RAG systems using open source tooling?A highlight on how open source can help
Get Valkey security patching and support with Ubuntu Pro
By Michelle Anne Tabirao, 16 December 2024
Canonical is pleased to announce security patching and support for Valkey through the Ubuntu Pro subscription. Ubuntu Pro is a subscription service for...
How does OpenSearch work?
By Michelle Anne Tabirao, 13 December 2024
How does opensearch work? OpenSearch is an open-source search and analytics suite. Developers build solutions for search and more!
What is RAG?
By Michelle Anne Tabirao, 13 December 2024
RAG explained: is a technique that enhances generative AI models by utilizing external knowledge sources such as documents and extensive databases.
Spark or Hadoop: the best choice for big data teams?
By Giulia Lanzafame, 10 December 2024
I always find the Olympics to be an unusual experience. I’m hardly an athletics fanatic, yet I can’t help but get swept up in the spirit of the competition....
Apache Spark 4.0 beta release – try it now
By robgibbon, 15 October 2024
Apache Spark is a popular framework for developing distributed, parallel data processing applications. Our solution for Apache Spark on Kubernetes has made...