Hajira Jabeen

Hajira Jabeen

Team Lead · AI for Research Data Management (AI4RDM)
Institute of Biomedical Informatics (BI-K) · University Hospital Cologne
1,600+ Citations
h20 h-index
11 PhD Students
€1.4M+ Funds Acquired

About

I am a senior researcher at the Institute of Biomedical Informatics (BI-K), University Hospital Cologne, where I work on the application of Artificial Intelligence for data-intensive problems in healthcare. My research focuses on data science and data management at scale, with particular emphasis on robust, reproducible, and FAIR-compliant handling of biomedical data. I develop scalable AI models and algorithms for data analytics and data stewardship, aiming to turn complex, heterogeneous data into reliable and actionable knowledge. Methodologically, my work integrates Knowledge Graphs, Natural Language Processing, and FAIR data principles, with a strong focus on practical applicability in clinical and research settings.

Previously, I was Team Leader for Big Data Analytics at the GESIS – Leibniz Institute for the Social Sciences, where I worked on large-scale data analytics and the development of the Methods Hub, focusing on reusable, scalable analytical workflows and tools for the social sciences. I also served as Group Leader for Distributed Semantic Analytics at the University of Bonn within the Smart Data Analytics (SDA) lab, and as a Data Science Expert at the University of Cologne within the CEPLAS cluster of excellence.

My research spans distributed analytics, data mining, semantic web technologies, and data FAIRification. I have contributed to multiple Horizon 2020–funded projects, designing and implementing scalable data and analytics architectures across domains including maritime systems, energy, food systems, social sciences, smart cities, and plant sciences. I hold a B.Ed. in teaching and have 10+ years of university-level teaching experience across BSc, MSc, and PhD programmes.

Email: hajira.jabeen[at]uk-koeln.de

Key Research Areas

Knowledge Graphs NLP / LLMs FAIR Data Data Engineering Semantic Web Ontology Engineering Big Data / Apache Spark Data Stewardship AI in Healthcare Graph Embeddings Data Governance Reproducible Science

Languages

English (Proficient) German (B2) Urdu (Native) Punjabi (Native)

Education

  • PhD (Summa cum laude, 4.0/4.0) – Classifier Generation using Genetic Programming, 2010
  • M.S. Computer Science (3.33/4.0), 2006
  • M.Sc. Computer Science (3.27/4.0), 2001
  • B.Sc. Mathematics, Grade A, 1999

Skills

Metadata & Data Governance

  • Metadata catalogue design & lifecycle
  • FAIR, DCAT, Dublin Core, schema.org
  • Controlled vocabularies & ontology design
  • Cross-domain metadata harmonization
  • GDPR-compliant data access

Semantic Technologies

  • Ontology engineering (OWL, RDFS)
  • Knowledge graph design & construction
  • RDF, SPARQL, semantic reasoning
  • Cross-lingual ontology matching

AI, NLP & Multi-Agent

  • LLMs, BERT, GLiNER, vLLM
  • Knowledge graph embeddings
  • LangChain for multi-agent systems
  • Information extraction & annotation

Data Engineering

  • ETL pipelines for heterogeneous data
  • PostgreSQL, MySQL, MongoDB, Cassandra
  • Apache Spark, Flink, Hadoop
  • Docker, cloud/hybrid infrastructures
  • REDCap, IRIS, data warehousing

Programming & MLOps

  • Python, PyTorch, TensorFlow, pySpark
  • Model fine-tuning & deployment
  • WANDB model tracking & versioning
  • Version control & collaborative dev

Leadership & Management

  • Cross-functional team leadership
  • FitSM / IT Service Management
  • Strategic planning & stakeholder comms
  • Grant acquisition & project management
  • Multinational collaboration

Publications

2020
Editors: V. Janev, D. Graux, H. Jabeen, E. Sallinger
Springer Nature. DOI: 10.1007/978-3-030-53199-7 [Open Access]
2024
Semantics for Culinary Health Care
H. Jabeen
In: Roles and Challenges of Semantic Intelligence in Healthcare Cognitive Computing (pp. 51–67). IOS Press.
2024
J.Z. Pan, S. Razniewski, …, H. Jabeen, …, D. Graux
Transactions on Graph Data and Knowledge (TGDK), 2024
2023
C.F. Draschner, H. Jabeen, J. Lehmann
International Journal of Semantic Computing, 17(02), 199–221, 2023
2023
M.S. Razzaq, F. Maqbool, M. Ilyas, H. Jabeen
IEEE Access [IF 3.36], 2023
2023
F. Maqbool, M. Fahad, M. Ilyas, H. Jabeen
Expert Systems [IF 2.81], 2023
2023
S. Ibrahim, S. Fathalla, J. Lehmann, H. Jabeen
IEEE Access, 11, 8581–8599 [IF 3.36], 2023
2022
TIER2: Enhancing Trust, Integrity and Efficiency in Research through Next-level Reproducibility
T. Ross-Hellauer, T. Klebel, A. Bannach-Brown, S. Horbach, H. Jabeen, N. Manola, et al.
Research Ideas and Outcomes, 8, e98457, 2022
2022
H. Allah, S. Fathalla, J. Lehmann, H. Jabeen
Enterprise Information Systems, 17(7), 2062683 [IF 4.3], 2022
2020
F.A. Musyaffa, M.-E. Vidal, F. Orlandi, J. Lehmann, H. Jabeen
Expert Systems with Applications, 147, 113135 [IF 4.2], 2020
2019
M. Ali, C.T. Hoyt, D. Domingo-Fernández, J. Lehmann, H. Jabeen
Bioinformatics, 35(18), 3538–3540 [IF 5.48], 2019
2019
P. Westphal, L. Bühmann, S. Bin, H. Jabeen, J. Lehmann
Semantic Web, 10(2), 231–245 [IF 2.55], 2019
2013
H. Jabeen, A.R. Baig
Neurocomputing, 116, 311–316 [IF 3.317], 2013
2012
H. Jabeen, A.R. Baig
Applied Soft Computing, 12(1), 416–422 [IF 3.907], 2012
2011
H. Jabeen, A.R. Baig
Computers in Human Behaviour, 27(5), 1475–1481 [IF 3.435], 2011
2011
H. Jabeen, A.R. Baig
International Journal of Innovative Computing, Information and Control, 8(1), 233–242 [IF 2.9], 2011
2026
A.A. Dayeh, H. Jabeen, O. Beyan
Medical Informatics Europe Conference (MIE), 2026
2026
N. Roqaya, H. Jabeen, T. Papenbrock
Intelligent Systems Conference (IntelliSys), 2026
2025
H. Jabeen
Research Conference on Metadata and Semantics Research (MTSR), 2025. Springer International Publishing.
2024
S. Linzbach, D. Dimitrov, L. Kallmeyer, K. Evang, S. Dietze, H. Jabeen
Proceedings of NAACL 2024: Human Language Technologies, Vol. 1 (Long Papers), pp. 3645–3655
2024
S. Gangopadhyay, S. Schellhammer, S. Hafid, D. Dessí, C. Koß, K. Todorov, H. Jabeen
Proceedings of the 35th ACM Conference on Hypertext and Social Media, pp. 246–258, 2024 [Best Paper Award Nomination]
2023
S. Linzbach, T. Tressel, L. Kallmeyer, S. Dietze, H. Jabeen
Companion Proceedings of the ACM Web Conference 2023, pp. 1145–1149
2023
S. Dietze, H. Jabeen, L. Kallmeyer, S. Linzbach
IEEE International Conference on Semantic Computing (ICSC), pp. 204–211, 2023
2023
F.B. Moghaddam, J. Lehmann, H. Jabeen
IEEE Sixth International Conference on AI and Knowledge Engineering (AIKE), 2023
2023
F.B. Moghaddam, J. Lehmann, H. Jabeen
IEEE International Conference on Semantic Computing (ICSC), pp. 204–211, 2023
2022
C.F. Draschner, J. Lehmann, H. Jabeen
IEEE International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), 2022
2022
F.B. Moghaddam, J. Lehmann, H. Jabeen
IEEE International Conference on Semantic Computing (ICSC), pp. 243–250, 2022
2021
C.F. Draschner, C. Stadler, F.B. Moghaddam, J. Lehmann, H. Jabeen
Proceedings of the 30th ACM International Conference on Information & Knowledge Management (CIKM), pp. 4465–4474, 2021
2021
F.B. Moghaddam, C.F. Draschner, J. Lehmann, H. Jabeen
International Conference on Semantic Systems (SEMANTiCS), 2021. IOS Press.
2021
C.F. Draschner, J. Lehmann, H. Jabeen
IEEE International Conference on Semantic Computing (ICSC), pp. 333–336, 2021
2021
F. Maqbool, S. Razzaq, A. Yar, H. Jabeen
IEEE Congress on Evolutionary Computation (CEC), pp. 2559–2566, 2021
2020
S. Ibrahim, S. Fathalla, J. Lehmann, H. Jabeen
IEEE/WIC/ACM International Joint Conference on Web Intelligence (WI-IAT), pp. 113–120, 2020
2020
H. Jabeen, E. Haziiev, G. Sejdiu, J. Lehmann
IEEE International Conference on Semantic Computing (ICSC), pp. 400–407, 2020
2020
H. Jabeen, J. Weinz, J. Lehmann
IEEE Congress on Evolutionary Computation (CEC), pp. 1–7, 2020
2019
M.N. Mami, D. Graux, S. Scerri, S. Auer, H. Jabeen, J. Lehmann
International Semantic Web Conference (ISWC), pp. 229–245, 2019. Springer.
2019
G. Sejdiu, A. Rula, J. Lehmann, H. Jabeen
International Semantic Web Conference (ISWC), pp. 261–276, 2019. Springer.
2019
M. Ali, H. Jabeen, C.T. Hoyt, J. Lehmann
International Semantic Web Conference (ISWC), pp. 3–18, 2019. Springer.
2019
H. Jabeen, N. Tahara, J. Lehmann
EvoMUSART 2019 (EvoStar), pp. 156–172. Springer.
2019
H. Jabeen, R. Dadwal, G. Sejdiu, J. Lehmann
European Knowledge Acquisition Workshop (EKAW), pp. 534–548. Springer, 2018.
2017
J. Lehmann, G. Sejdiu, L. Bühmann, P. Westphal, C. Stadler, I. Ermilov, …, H. Jabeen
International Semantic Web Conference (ISWC), pp. 147–155, 2017. Springer.
2009
H. Jabeen, Z. Jalil, A.R. Baig
Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation (GECCO), pp. 2047–2052, 2009
2020
H. Jabeen
In: Knowledge Graphs and Big Data Processing, Chapter 3, pp. 35–55. Springer, 2020.
2020
H. Jabeen, D. Graux, G. Sejdiu
In: Knowledge Graphs and Big Data Processing, Chapter 7, pp. 105–121. Springer, 2020.
2017
H. Jabeen, P. Archer, S. Scerri, A. Versteden, I. Ermilov, G. Mouchakis, J. Lehmann, S. Auer
EDBT/ICDT Workshops, 2017
2010
H. Jabeen, A.R. Baig
Studies in Computational Intelligence (SCI), Vol. 284. Springer, 2010.
2025
Click, Upload, Confuse? Usability Testing of Dataverse and FAIRDOM-SEEK
H. Jabeen
Data Stewardship goes Germany (DSgG), 2025
2025
Beyond Compliance: Human-Centered FAIR Data Tools & Management
H. Jabeen
Helmholtz Metadata Conference (HMC), 2025
2025
Data Stewards, the Architects of FAIR
H. Jabeen, S. Avila-Calero
Fellowship of the Data – International RDM Community Meeting, 2025
2019
M.N. Mami, D. Graux, S. Scerri, H. Jabeen, S. Auer, J. Lehmann
ISWC Posters and Demos, 2019
2019
M.N. Mami, D. Graux, S. Scerri, H. Jabeen, S. Auer
The World Wide Web Conference (WWW), 2019
2018
D. Graux, G. Sejdiu, H. Jabeen, J. Lehmann, D. Sui, D. Muhs, J. Pfeffer
SEMANTiCS Conference, 2018
2017
The Tale of Sansa Spark [Best Demo Award, ISWC 2017]
I. Ermilov, J. Lehmann, G. Sejdiu, L. Bühmann, P. Westphal, …, H. Jabeen
International Semantic Web Conference (ISWC), Posters, Demos & Industry Tracks, 2017
2023
D. Biswas, S. Linzbach, D. Dimitrov, H. Jabeen, S. Dietze
KBC-LM/LM-KBC @ ISWC, 2023
2023
vS. Gangopadhyay, K. Boland, D. Dessí, S. Dietze, P. Fafalios, A. Tchechmedjiev, …, H. Jabeen
D2R2 – 2nd International Workshop on Linked Data-Driven Resilience Research, 2023
2021
C.F. Draschner, F.B. Moghaddam, J. Lehmann, H. Jabeen
LAMBDA Big Data Analytics Doctoral Workshop, 2021
2021
F.B. Moghaddam, C.F. Draschner, J. Lehmann, H. Jabeen
LAMBDA Big Data Analytics Doctoral Workshop, 2021
2021
D. von Suchodoletz, T. Mühlhaus, D. Brilhaus, H. Jabeen, B. Usadel, J. Krüger, H. Gauza, C. Martins Rodrigues
E-Science Tage, 2021
2024
[PromptEng] First International Workshop on Prompt Engineering for Pre-Trained Language Models
D. Graux, S. Montella, H. Jabeen, C. Gardent, J.Z. Pan
Companion Proceedings of the ACM on Web Conference, pp. 1311–1312, 2024
2023
The FAIRy Tale of Genetic Algorithms
F. Maqbool, M.S. Razzaq, H. Jabeen
arXiv preprint arXiv:2305.00238, 2023
2022
T. Ross-Hellauer, T. Klebel, A. Bannach-Brown, S. Horbach, H. Jabeen, N. Manola, et al.
Research Ideas and Outcomes (RIO Journal), 2022

Experience

AI Expert & Data Scientist
Institute of Biomedical Informatics (BI-K) · University Hospital Cologne
01/2026 – present

Focusing on AI and ML methods applied to health data, developing scalable and reusable solutions for clinical use. Using LLMs (e.g., GLiNER, BERT) for de-identification, clinical code annotation, and data summarisation for clinicians; employing PostgreSQL and Grafana for data quality monitoring.

Key Areas:
  • Natural Language Processing, Data Engineering, Big Data, Information Extraction
Team Lead · AI for Health Data Management (AI4RDM)
University Hospital Cologne
10/2024 – 01/2026

Led AI-driven metadata and research data management solutions aligned with FAIR principles. Collaborated with NFDI4DS and NFDI4Health on metadata standards and cross-domain interoperability. Delivered workshops, tutorials, and webinars on RDM across Clinical Research Centres. Provided strategic and technical leadership for clinician-facing services including the FAIRdata-Cologne Dataverse catalogue, FAIRSpace Cologne, and REDCap.

Platforms Operated:
  • FAIRdata-Cologne (Dataverse-based metadata catalogue)
  • FAIRSpace Cologne – secure resource sharing
  • REDCap – GDPR-compliant data capture and exchange
01/2022 – 09/2024

Led design and development of metadata-aware analytics services for large-scale social science datasets. Applied knowledge graphs and semantic data modelling to enable FAIR-compliant data access, integration, and reuse. Collaborated with national data infrastructure initiatives including NFDI4DS and NFDI(BERD). Contributed to the Methods Hub for reproducible and explainable analytics methods.

Key Areas:
  • NLP, Data Engineering, Big Data, Quality & Compliance Management
  • FitSM / IT Service Management
Academic Data Scientist · FAIR Data Management
University of Cologne · CEPLAS Cluster of Excellence
05/2020 – 12/2021

Led FAIR Data Management initiatives at CEPLAS, designing and implementing a comprehensive FDM solution in collaboration with the DataPlant consortium. Coordinated with multiple NFDI consortia. Organised workshops on Research Data Management and supported development of data management plans.

Research Group Lead · Distributed Semantic Analytics
University of Bonn · Smart Data Analytics Lab
2016 – 2020

Head of the "Distributed Semantic Analytics" research group. Oversaw research in distributed analytics, knowledge graphs, and machine learning. Secured and managed multiple research grants; led teaching and organisational responsibilities.

Projects led:
  • Co-PI: Bio2Vec (Smart Data Analytics for Life Sciences)
  • Co-PI: HSP – Smoothed Analysis of Machine Learning Algorithms
  • Technical Lead: H2020 Big Data Ocean
  • Technical Lead: H2020 Big Data Europe ("Big Data Integrator Platform")
  • Knowledge Transfer Expert: Gradana, Lambda, Cleopatra
Postdoctoral Researcher
Leipzig University
2015 – 2016

Work package lead on the Horizon 2020–funded Big Data Europe project, developing a multi-purpose, open-source, and scalable platform for European research communities. Research in Description Logics, Structured Machine Learning, and Semantic Web with Apache Spark, Flink, and Docker.

Assistant Lecturer & Researcher
IT University of Copenhagen
2014 – 2015

Teaching (Software Architecture, Data Mining) and active membership in the GameAI and REAL research groups. Work on Monte Carlo Tree Search, procedural game development, and evolutionary algorithms.

Software Engineer
TEO Intl A/S, Copenhagen
2013 – 2014

Worked in a team project on an accelerated global team-building software solution.

Head of Department & Assistant Professor
IQRA University, Islamabad
2009 – 2013

Head of the Computing and Technology department (20+ staff, ~2,000 students). Led departmental accreditation, curriculum development, and hiring. Taught undergraduate, graduate, and PhD courses with consistently outstanding evaluations. First female PhD graduate from NU-FAST, 2010.

Funded Research Projects

Technical Lead · H2020
Development of the "Big Data Integrator Platform" for handling large volumes of heterogeneous data. Demonstrates use cases across seven EU societal challenges: Climate, Energy, Food, Health, Transport, Security, and Social Sciences.
2015 – 2018
Technical Lead · H2020
Technical lead for data harmonisation and analytics platform for maritime data.
2016 – 2019
Co-PI · CRG/KAUST
Smart data analytics for life sciences; knowledge graph embeddings for biological data.
2017 – 2020
Co-PI · H2020
Digital Platform and Analytic Tools for Energy (DIgital PLAtform and analytic TOOls for eNergy).
2020 – 2022
Academic Expert · H2020
Learning, Applying, Multiplying Big Data Analytics. Knowledge exchange and education capacity-building for the West Balkans region.
2018 – 2020
Academic Expert · H2020
Cross-lingual event-centric open analytics research academy.
2019 – 2022
Academic Expert · H2020
Knowledge exchange and educational capacity building in Big Data analytics.
2017 – 2019
ML Expert
Technical advisory for platform development and multimodal data analytics for open fiscal data.
2016 – 2018
ML Consultant · H2020
Big data intermediate service layer for user-centric services and tools covering the full data lifecycle for Earth Observation (Copernicus) data.
2019 – 2021
ML Consultant · H2020
Scalable Linked Data Integration and Exploitation of Points-of-Interest data. RDF transformation, interlinking, enrichment, fusion, and quality assessment.
2017 – 2020
Co-PI · EU Horizon
Enhancing Trust, Integrity and Efficiency in Research through next-level Reproducibility.
2022 –
Smoothed Analysis of ML Algorithms
Co-PI · Hochschulpakt (HSP)
Theoretical analysis and practical improvements in the smoothed complexity of machine learning algorithms.
2016 – 2018

Funding Acquired

ProjectSourceAmount
TIER2EU Horizon€162,500
PLATOONEU H2020€507,500
LAMBDAEU H2020€181,718
CleopatraEU H2020€377,989
Bio2VecCRG / KAUST€80,000
Smoothed Analysis of ML AlgorithmsHochschulpakt (HSP)€70,000
CSCUBS ConferenceInternal€21,000
Total€1,400,706

Supervision

PhD Students — Completed

  • 2025Cross-Lingual Ontology Matching and Enrichment — Shimaa Khaled Shaker Ibrahim, University of Bonn
  • 2025Distributed Decision Models for Structured Data — Hebaallah Ibrahim Abdelrehim Mohamed, University of Bonn
  • 2024Distributed Anomaly Detection — F. Moghadam Bakhshandegan, University of Bonn
  • 2024Intelligent Cognitive Systems for Automated Recipe Generation — Muhammad Saad Razzaq, University of Sargodha
  • 2024Scalability and Fairification of Evolutionary Algorithms — Fahad Maqbool, University of Sargodha
  • 2023Distributed Machine Learning for Knowledge Graphs — Carsten Drachner, University of Bonn
  • 2020Scalable Multilingual and Heterogeneous Fiscal Data Integration — Fathoni Musyafa, University of Bonn
  • 2020Distributed RDF Processing using Apache Spark — Gezim Sejdiu, University of Bonn
  • 2016Performance Analysis of Evolutionary Computation for Continuous Optimization Problems — Qamar Abbas, Iqra University

PhD Students — In Progress

  • ongoingIn ProgressMedical Data Annotations using Foundation Models — Mehrshad Jaberansary, University of Cologne
  • ongoingIn ProgressData Harmonization in Medical Records using Knowledge Graphs and Foundation Models — Ahmad Abu Dayeh, University of Cologne
  • ongoingIn ProgressLLMs as Patient Information Tools in ADPKD — Sarah Vilayil, University of Cologne
  • ongoingIn ProgressContext-Aware Learning in Large Language Models — Stephan Linzbach, GESIS
  • ongoingIn ProgressLarge-Scale Entity Linking for Claims Retrieval — Susmita Gangopathi, GESIS
  • ongoingIn ProgressScalable Distributed Terminological Decision Trees for RDF — Heba Allah, University of Bonn
  • ongoingIn ProgressDistributed Knowledge Fusion in Knowledge Graphs — Shimaa Khalid, University of Bonn

Master Theses

  • 2023Analysis of Syntax on Fact Retrieval from Large Language Models — Tim Tressel
  • 2021Ontology-Based Modelling of Food Recipe Concepts and Processes — Hammad Malik
  • 2021Ontology Webform: Generating Web Forms using Ontologies — Kunal Rout
  • 2020Indexed Negative Sampling and Scalable Knowledge Graph Embeddings — Tasneem Tazeen Rashid
  • 2020Learning Defects from Aircraft Non-Destructive Testing (NDT) Data — Navya Prakash (DLR, Cologne)
  • 2020Evolution and Generation of Cooking Recipes — Jonas Weinz
  • 2019Smart Chef – Evolving Recipes — Carsten Felix Draschner
  • 2019Automated Link Discovery for Data Harmonisation in the Maritime Domain — Jaime Manuel Trillos
  • 2019Scalable Entity Resolution — Amrit Kaur
  • 2019Distributed In-Memory SPARQL Processing — Haziiev Eskender
  • 2019Scalable RDF Clustering — Pratik Kumar Agarwal
  • 2018Rule Mining on Distributed RDF Data — Kunal Jha
  • 2018Machine-Generated Recipes using Evolutionary Algorithms — Nargis Tahara
  • 2018Distributed RDF Clustering Framework — Tina Boroukhian
  • 2018Distributed Data Parsing and Vandalism Detection on Large-Scale Knowledge Graphs — Nayef Fayez Roqaya
  • 2018Scalable Numerical Outlier Detection in Knowledge Graphs — Rajjat Dadwal
  • 2017Association Rule Mining of Linked Data using Apache Spark — Theresa Nathan
  • 2017Reserving Algorithms and Portfolio Analysis for Life Insurance — Amir Ansari (SCOR, Cologne)
  • 2016Extraction and Fusion of Identity Data — Yujie Diao (SMS Group GmbH)
  • 2015Developing a Believable MCTS Agent in a Real-Time Platform Shooter — Lasse Knudsen, Lasse Joergensen
  • 2014Rule Discovery using Differential Evolution for Breast Cancer Survival — Zubair Farooq
  • 2013Comprehensive Rules Discovery using PSO for Back Pain Diagnosis — Hasan Kamal
  • 2013A Modified Decreasing Inertia Weight in PSO — Muhammad Husnain
  • 2010Modified PSO with Laplace Mutation Operator (LMPSO) — Muhammad Imran

Teaching

Special Schools & Tutorials

  • Research Data ManagementCEPLAS, University of Cologne
  • Big Data in PracticeGradana Summer School (2019)
  • Big Data ArchitectureGradana Summer School (2019)
  • Smart Citizen ServicesGradana Summer School (2020)
  • Distributed Big Data FrameworksLAMBDA Summer School (2019, 2020)
  • Semantic-Aware AnalyticsTutorial at ESWC (2019), ISWC (2020)
  • Knowledge GraphsCleopatra Summer School

PhD Level

  • Theoretical Computational IntelligenceIqra University, 2011–12
  • Advanced Analytics / Data ScienceIqra University, 2012–13

Master Level

  • AI in MedicineUniversity of Cologne, 2025–26
  • LLM Journal ClubUniversity of Cologne, 2025–26
  • Deep Learning in MedicineUniversity of Cologne, 2025–26
  • Big Data AnalyticsUniversity of Bonn, 2017–2019
  • Knowledge Graph AnalyticsUniversity of Bonn, 2017–2018
  • Data MiningIT University of Copenhagen, 2015
  • Advanced Data MiningIqra University, 2011–12
  • Advanced Artificial IntelligenceIqra University, 2012
  • Advanced Operating SystemsIqra University, 2013
  • Advanced Analysis of AlgorithmsIqra University, 2010–13

Seminars

  • Model-Driven Software EngineeringUniversity of Bonn, 2016
  • Knowledge GraphsUniversity of Bonn, 2019
  • AI in MedicineUniversity Hospital Cologne, 2024–26
  • Semantic Web and Data StandardizationUniversity Hospital Cologne, 2024–26
  • Research Data ManagementUniversity Hospital Cologne, 2024–26

Bachelor Level

  • Deep Learning in MedicineUniversity Hospital Cologne, 2025–26
  • Python ProgrammingUniversity of Cologne, Fall 2021
  • Introduction to Computer ScienceIqra University, 2010–13
  • Computer Programming (C++, Java)Iqra University / ABACUS, 2010–13
  • Machine LearningIqra University, 2012
  • Object-Oriented ProgrammingIqra University, 2012
  • Data Mining / Data ScienceIqra University, 2011
  • Artificial IntelligenceIqra University, 2013
  • Computer ArchitectureCOMSATS, 2011
  • Information SystemsCOMSATS University, Attock, 2011

Academic Service

Conference & Workshop Organisation

  • Organiser & Track Chair — NORA Workshop on Knowledge Graphs & Agentic Systems Interplay, NeurIPS 2025 and AACL 2026
  • Organiser — AI-Days Conference, University of Cologne, 2026
  • Organiser & Track Chair — International Workshop on Prompt Engineering for Pre-Trained Language Models, The Web Conference 2024, 2025, 2026
  • Organiser & Panel Lead — LASCAR Workshop at ESWC 2019, ESWC 2020
  • Organiser & Presenter — SANSA Tutorial at ESWC 2019, ISWC 2020
  • General Chair — Computer Science Conference for University of Bonn Students (CSCUBS), 2018
  • Senior Adviser — CSCUBS, 2019
  • Organiser — GRADANA School, Bonn, Germany, 2017
  • Organiser — GRADANA School, Thessaloniki, Greece, 2019
  • Organiser — LAMBDA BDA School, Belgrade, Serbia, 2019

Programme Committee Membership

  • ICML 2025
  • SIGIR 2022–2025
  • ESWC 2016–2024
  • AIKE 2022, 2023
  • IEEE ICSC 2020
  • ICACS 2019
  • Data Analytics 2016
  • FIIT 2015
  • ICACC 2015
  • IEEE WCCI 2014
  • IEEE ICCSIT 2011

Journal Reviewing

  • Knowledge-Based Systems
  • Semantic Web Journal
  • Neurocomputing
  • Applied Soft Computing

Invited Talks

  • From Language to Care: Opportunities and Challenges of Using LLMs in Clinical Text — AI-Days, University of Cologne, 2026
  • Transformers Demystified: A Tutorial on Self-Attention — AI-Days, University of Cologne, 2026
  • LLMs Demystified: Opportunities and Challenges in Life Sciences — BullNet Training Week, University of Zurich, 2026
  • Scalable Data Management for AI: From Architectures to Applications — University of Lancaster, 2025
  • Transformer Architectures — Berliner Hochschule für Technik, 2025
  • Data Governance and Data Stewardship — University of Bonn, 2025
  • Methods and Tools for Managing the AI Application Life Cycle — Technische Hochschule Köln, 2024
  • Meet the Experts: LLMs – Opportunities and Challenges for the Social Sciences, 2024
  • Using the SANSA Stack on a 38 Billion Triple Ethereum Blockchain Dataset — SEMANTiCS, 2018
  • Distributed Knowledge Graph Processing — ZEF, University of Bonn, 2018
  • Big Data Europe Platform — European Big Data Value Forum, Paris, 2017
  • Big Data Integrator Platform — Apache Big Data Europe, Seville, 2016
  • Data Classification using Genetic Programming — IT University of Copenhagen, 2014
  • Evolutionary Algorithms for Data Classification — Aarhus University, 2013
  • Genetic Programming and Classification — University of Cardiff, 2010

Awards & Achievements

Professional Certifications