mikulskibartosz.name
Start here
About me
Twitter
Mastodon
Hire me
Bartosz Mikulski
Leveraging AI to drive growth and innovation
All Stories
How to check whether a regular expression matches a string in Hive
What is the equivalent of Athena/Presto regexp_like in Hive
How to check whether a YARN application has finished
How to use Airflow PythonSensor to check whether a YARN application finished running
How to use WHEN CASE queires in AWS Athena
Using conditions in AWS Athena queries
How to decode base64 to text in AWS Athena
How to use from_base64 in AWS Athena
How to combine two DataFrames with no common columns in Apache Spark
Use full outer join to combine two Apache Spark DataFrames with no common columns
How to get names of columns with missing values in PySpark
How to get the names of missing properties for every row in a PySpark Dataframe
How to set a different retry delay for every task in an Airflow DAG
How to use a different retry delay in every Airflow task
How to find the Hive partition closest to a given date
How to use Airflow to find the Hive partition closest to a given date
Get the date of the previous successful DAG run in Airflow.
Get the start time or the execution date of the previous successful DAG run in Airflow
How to prevent Airflow from backfilling old DAG runs
How to disable backfilling of an Airflow DAG or skip a part of the DAG during a backfill
What is s3:TestEvent, and why does it break my event processing?
S3 sends s3:TestEvent to SQS after setting up the bucket notifications
Making OFFSET LIMIT queries in AWS Athena
How to use OFFSET in AWS Athena queries
« Prev
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
Next »