Content
I like to write about technology on AWS blogs, LinkedIn or Medium. You can also see my public research publications and slides.
Detect & handle data skew on AWS Glue
Techniques and configurations to mitigate skew and improve Spark job reliability on Glue.
AWS Big Data BlogDifferentiate GenAI with your data
Blueprints to ground GenAI with enterprise data using AWS analytics and managed databases.
AWS Big Data BlogDebezium & Kafka with Iceberg (Part II)
Moving CDC into a native AWS stack: Glue, Aurora, MSK Connect & S3 Tables.
MediumDebezium & Kafka with Iceberg (Part I)
Build a modern data pipeline with Debezium for CDC, Kafka messaging, Iceberg tables and MinIO storage.
MediumUsing S3 Tables with Iceberg Java API
An introduction to the Iceberg REST catalog for S3 tables and examples of creating and manipulating tables via the Java API.
MediumOpenSearch Neuronal plugin
Configure the ML-based Neuronal plugin to improve search, contrasting MLT and k‑NN approaches.
Medium