Skip to content
View OrangePomeranian's full-sized avatar
🌎
Focusing
🌎
Focusing

Block or report OrangePomeranian

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
OrangePomeranian/README.md


Hi, I'm a Data Engineer with a strong interest in bioinformatics. I hold a Master’s degree in Bioinformatics and combine technical engineering expertise with a solid scientific background. In my current role, I design, implement, and optimize complex database models to support business analytics and data-driven decision-making. I build scalable data pipelines and maintain efficient data flows using Databricks, GitLab, SQL, and cloud-based technologies.

I remain closely connected to bioinformatics and NGS analysis, and I enjoy bringing together data engineering and biological insight to create meaningful, production-ready solutions.

I’m great fan of foreign languages (currently working on German 🇩🇪 and Mandarin 🇨🇳 skills), cuisines 🥟 and cultures. I love reading literature in their native language 📖, listen to English and German music (Linkin Park and Rammstein fan here 🎸) - but still open to new foreign experiences 🤩.


Key Skills

  • Programming Languages & Tools: Python, R, SQL, Linux, Command Line, Git, GitLab, Docker, Snakemake, Nextflow, Databricks
  • Data Engineering: Building scalable data pipelines, designing complex database models, ETL workflows, cloud-based data processing, version control and CI/CD with GitLab
  • Bioinformatics: Experience in NGS data analysis, RNA-seq pipelines, biostatistics, and working with biological databases such as NCBI and GenBank; familiar with the Galaxy platform

Languages

  • Dutch: Limited working proficiency
  • English: Full professional proficiency
  • German: Limited working proficiency
  • Polish: Native

Pinned Loading

  1. LIMS_database LIMS_database Public

    Project of Medical Database with the possibility of logging for users and adding new data to base

    PHP

  2. Analiza_Danych_projekt Analiza_Danych_projekt Public

    Finding mutations in genomic data with the use of the chi2 test and Parallel functions in Python and R

    R 1

  3. Bachelor_thesis Bachelor_thesis Public

    The performance of individual CNV detection software and state-of-the-art sequencing. All analyses were performed using the Python and R programming languages.

    Python 1

  4. Master_thesis Master_thesis Public

    Analysis of SNPs that have the greatest impact of appearance of mammary gland tumour in dogs.

    R 1

  5. Pipeline_with_Snakemake Pipeline_with_Snakemake Public

    Genomic Analysis Pipeline: Automate data preprocessing, variant calling, and annotation with Snakemake. Ensure reproducibility and reliability in genomic studies.

    HTML

  6. ChiP-seq ChiP-seq Public

    This project explores the PBRM1-PIAS1 interaction in epithelial differentiation through ChIP-seq analysis, highlighting EZH2's role and implications for cholesterol biosynthesis in cellular processes.

    Jupyter Notebook