mikulskibartosz.name
Start here
About me
Twitter
Mastodon
Hire me
Bartosz Mikulski
Leveraging AI to drive growth and innovation
All Stories
What is the difference between a transformation and an action in Apache Spark?
What is an action in Apache Spark? What do you understand as transformations in Apache Spark?
Use LatestOnlyOperator to skip some tasks while running a backfill in Airflow
How to skip some tasks when backfilling a DAG in the past
Christopher Bergh - How the DataOps principles help data engineers make data pipelines trustworthy
An interview with Christopher Bergh who explains how the DataOps principles help data engineers make data pipelines trustworthy
How to retrieve the statuses of the recent DAG executions from Airflow database
How to make a dashboard that displays Airflow DAG statuses
How to find and terminate an idle Redshift session
How to find the idle session that is blocking the connection pool in Redshift
How to configure Spark to maximize resource usage while using AWS EMR
How to configure EMR to use all available resources when running a Spark cluster
How to use AWSAthenaOperator in Airflow to verify that a DAG finished successfully
How to check that an AWS Athena table contains data after running an Airflow DAG.
How to start an AWS Glue Crawler to refresh Athena tables using boto3
How to create and start an AWS Glue Crawler from Python code using boto3
How to retrieve the table descriptions from Glue Data Catalog using boto3
How to get the comments from the create table statements when the metadata is stored in the Glue Data Catalog
How to perform a batch write to DynamoDB using boto3
How to write multiple DynamoDB objects at once using boto3
How to populate a PostgreSQL (RDS) database with data from CSV files stored in AWS S3
How to upload S3 data into RDS tables
How to concatenate multiple MySQL rows into a single field?
How to concatenate multiple rows into a string in MySQL
« Prev
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
Next »