acmanjon - Sr. PSA at AWS & Flow Master

Ángel Conde Manjón

Senior Partner Solutions Architect — Data & AI @ AWS. I help partners build analytics platforms with Spark, Iceberg, EMR Serverless, and GenAI grounded in your data.

Madrid · Bilbao · Remote EMEA
Ángel Conde Manjón

About

Ph.D. in Computer Science. I specialize in building high‑performance, open data platforms on AWS: Iceberg‑based lakes, serverless Spark, streaming with Kafka, and production‑grade AI/ML foundations.

I collaborate with AWS Partners across EMEA—helping them design, benchmark, and take to market Data & AI solutions.

When I’m not sat at my desk I am very active and spend most of my time outdoors. I do a lot of sports, such as mountain biking, surfing, hiking, running or snowboarding.

Apart from doing sports, I like to watch TV series 🍿 and landscape photography.

Focus areas

  • Apache Spark
  • Apache Iceberg · S3 Tables
  • EMR Serverless · Glue
  • Kinesis · Kafka
  • Redshift · Athena
  • Machine Learning -> GenAI + your data

What I do best

Modern data platforms

Designing LakeHouse architectures on S3 with robust governance.

High‑performance Spark

Optimizing Spark jobs, leveraging Gluten/Velox or other native platforms, and solving data‑skew and shuffle bottlenecks.

AI with your data

Blueprints to differentiate GenAI apps by grounding with enterprise data via AWS analytics and managed databases.

Experience

  1. Amazon Web Services — Sr. Partner Solutions Architect (Data & AI)

    Aug 2021 – Present

    Partner‑facing specialist for Data & AI across EMEA. Enable Iceberg‑based lakes, EMR Serverless, Glue, Redshift, Athena, and GenAI foundations.

  2. IKERLAN — Data Analytics & AI Team Lead

    May 2019 – Jul 2021

    Led the Data & AI team delivering industrial analytics and edge‑to‑cloud solutions with streaming and ML at scale.

  3. IKERLAN — Dev Ops & Data Engineer

    Sep 2015 – Jul 2019

    Built data ingestion and processing platforms; automation and CI/CD for analytics workloads.

  4. University of the Basque Country — Researcher

    Sep 2010 – Sep 2015

    Researcher at Galan research group on Natural Language Processing & Big Data for Smart tutoring systems.

Content

I like to write about technology on AWS blogs, LinkedIn or Medium. You can also see my public research publications and slides.

Detect & handle data skew on AWS Glue

Techniques and configurations to mitigate skew and improve Spark job reliability on Glue.

AWS Big Data Blog

Differentiate GenAI with your data

Blueprints to ground GenAI with enterprise data using AWS analytics and managed databases.

AWS Big Data Blog

Debezium & Kafka with Iceberg (Part II)

Moving CDC into a native AWS stack: Glue, Aurora, MSK Connect & S3 Tables.

Medium

Debezium & Kafka with Iceberg (Part I)

Build a modern data pipeline with Debezium for CDC, Kafka messaging, Iceberg tables and MinIO storage.

Medium

Using S3 Tables with Iceberg Java API

An introduction to the Iceberg REST catalog for S3 tables and examples of creating and manipulating tables via the Java API.

Medium

OpenSearch Neuronal plugin

Configure the ML-based Neuronal plugin to improve search, contrasting MLT and k‑NN approaches.

Medium

Open source projects

I contribute to and maintain open source projects around Spark, Iceberg, streaming and data engineering. Here are a repositories:

Let’s build something impactful

Reach out for architecture reviews, enablement, or to collaborate on Data & AI content.