Skip to content
View mgorsk1's full-sized avatar
  • Warsaw, Poland

Block or report mgorsk1

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mgorsk1/README.md

Hi, I'm Mariusz✌🏻

~ whoami

I am an experied Tech Lead within ING 🦁 Advanced Analytics, contributing to Data Analytics Platform.

My areas of involvement within last few years:

  • Platform, software and data engineering
  • Data cataloging, discoverability and lineage
  • Data ingestion frameworks
  • Data quality / profiling

πŸ–₯️ Technologies

I have strong experience in distributed systems, leveraging modern technologies such as:

  • Kubernetes
  • Apache Spark
  • Apache Airflow
  • Confluent Kafka
  • Elastic Stack

πŸ… Certifications

I hold following certificates:

  • Google Cloud Professional Cloud Architect (PCA)
  • Google Cloud Associate Engineer (AE)
  • Kubernetes Certified Application Developer (CKAD)

πŸŽ“ Conducting trainings

Trainings Training Participants

I also am a trainer for a Polish training company Sages where I am responsible for conducting Elastic Stack and Apache Spark related trainings.

</> Open Source

My experience revolves mostly around Open Source technologies, towards which I have a strong fondness. I am proud to be a contributor/maintainer for:

  • πŸ” OpenMetadata /contributor/ - an all-in-one platform for data discovery, data lineage, data quality, observability, governance, and team collaboration
  • πŸ” Amundsen (LF AI) /maintainer/ - a data discovery and metadata engine for improving the productivity of data analysts, data scientists and engineers when interacting with data
  • πŸ” Trino /contributor/ - a fast distributed SQL query engine for big data analytics that helps you explore your data universe
  • 🌬️ Apache Airflow /contributor/ - a platform to programmatically author, schedule and monitor workflows
  • 🌍 Apache Atlas /contributor/ - a metadata governance framework

Activity

πŸ“ Medium Stories

Although rather seldom, I sometimes write medium stories:

My personal projects

My hobbies

  • β˜• Coffee
  • 🚴 Cycling
  • πŸ”΄ Snooker

Find me elsewhere

Pinned Loading

  1. garbage-detector-app garbage-detector-app Public

    Jupyter Notebook 3

  2. snooker snooker Public

    snooker is a Python package providing thin wrapper over publicly available API for retrieving Snooker statistics.

    Python 4 1

  3. brryle brryle Public

    A simple search engine demonstrating full-text search capabilities of Elasticsearch.

    TypeScript 1

  4. amundsen amundsen Public

    Forked from amundsen-io/amundsen

    Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data – πŸ—ƒπŸ•΅οΈβ€β™€οΈ

    Smarty

  5. pw-bigdata-project-python pw-bigdata-project-python Public

    Jupyter Notebook

  6. pw-bigdata-project-scala pw-bigdata-project-scala Public

    Streaming

    Python