Hajira Jabeen, PhD

~ Data Science Expert at CEPLAS - Cluster of Excellence on Plant Sciences ~

University of Cologne

Working as Data Science Expert at CEPLAS and an associate researcher at Smart Data Analytics . My research interests are distributed analytics, semantic web, data mining, big data and FAIRification of data. I have worked on several H2020 funded projects related to big data, scalable architecture design, and distributed analytics, in several domains inluding maritime, energy, food, smart cities and, now, plant science.
I am interested in organizational activities, teaching, research, and industrial collaborative projects.
Photo
Conferences Organization
General Chair
  1. Workshop on Large Scale RDF Analytics, LASCAR 2020
  2. Workshop on Large Scale RDF Analytics, LASCAR 2019
  3. Computer Science Conference for University of Bonn Students, CSCUBS 2018
Senior Adviser
  1. Computer Science Conference for University of Bonn Students, CSCUBS 2019
Special Schools Organization
  1. GRADANA school, Bonn, Germany 2017
  2. GRADANA school, Thessaloniki, Greece 2019
  3. LAMBDA BDA school, Belgrad, Serbia, 2019
Journal Reviewer
  1. Knowledge Based Systems
  2. Semantic Web Journal
  3. Neurocomputing
  4. Applied Soft Computing
Program Committee Member
  1. IEEE International Conference on Semantic Computing, ICSC 2020
  2. Extended Semantic Web Conference, ESWC 2020
  3. International Conference on Advancements in Computational Sciences ICACS-2019
  4. Data Analytics 2016
  5. Extended Semantic Web Conference ESWC 2016
  6. International Conference on Collection, handling, and documentation of digital forensics ICACC-2015
  7. International Conference on Frontiers of Information Technology FIIT 2015
  8. International Conference on Anti-Cybercrimes, ICACC-2015
  9. IEEE World Congress on Computational Intelligence WCCI 2014
  10. IEEE International Conference on Computer Science and Information Technology, IEEE-ICCSIT-2011
Invited Talks
  1. Using the SANSA Stack on a 38 Billion Triple Ethereum Blockchain Dataset, “SEMANTiCS”, 2018
  2. Distributed Knowledge Graph Processing, at ZEF, University of Bonn, 2018
  3. Big Data Europe Platform at “European Big Data Value Forum” , Paris, France, 2017
  4. Big Data Integrator Platform at “Apache Big Data Europe”, Seville, Spain, 2016
  5. Data Classification using Genetic Programming, at IT University of Copenhagen, Denmark, 2014
  6. Evolutionary Algorithms for Data Classification, at Aarhus University, Denmark, 2013
  7. Genetic Programming and Classification, at “University of Cardiff”, Cardiff, England, 2010
  8. Many Webinars for Big Data Europe
PhD
  1. Theoretical Computational Intelligence
  2. Data Mining
Master Theses
  1. Distributed Big Data Analytics
  2. Knowledge Graph Analytics
  3. Advance Data Mining
  4. Advance Artificial Intelligence
  5. Advance Operating Systems
  6. Advance Computer Architecture
  7. Advance Analysis of Algorithms
Seminars
  1. Model Driven Software Engineering
  2. Knowledge Graphs
Bachelor
  1. Introduction to Computer Science
  2. Computer Programming ( C++, Java)
  3. Web Development
  4. Compute Architecture
  5. Machine Learning
  6. Object Oriented Programming
  7. Data Mining
  8. Artificial Intelligence
  9. Information Systems
  10. Computer Organization and Assembly
    Peer Reviewed Journal Articles
  1. IOTA: Interlinking of Heterogeneous Multilingual Open Fiscal DaTA
    Musyafa, F., Vidal, M, Orlandi, F, Lehmann, J. and H Jabeen, 2019
    Expert Systems With Applications
  2. BioKEEN: A library for learning and evaluating biological knowledge graph embeddings
    M Ali, CT Hoyt, DD Fernandez, J Lehmann, H Jabeen, 2019
    Bioinformatics
  3. OPSODE: Opposition based particle swarm optimization instilled with differential evolution
    Q Abbas, J Ahmad, H Jabeen
    International Journal of Advanced and Applied Sciences, 2017
  4. Tournament selection mechanism based random vector selection in differential evolution algorithm
    Q Abbas, J Ahmad, H Jabeen
    International Journal of Advanced and Applied Sciences, 2017
  5. Random Controlled Pool base Differential Evolution Algorithm (RCPDE)
    Q Abbas, J Ahmad, H Jabeen
    Intelligent Automation & Soft Computing, 2017
  6. SML-Bench:A benchmarking framework for structured machine learning
    P Westphal, L Bühmann, S Bin, H Jabeen, J Lehmann
    Semantic Web, 1-15, 2017
  7. Fitness Proportionate Random Vector Selection based DE Algorithm (FPRVDE)
    Q Abbas, J Ahmad, H Jabeen
    International Journal of Advanced Computer Science and Applications(IJACSA) , 2016
  8. A novel tournament selection based differential evolution variant for continuous optimization problems
    Q Abbas, J Ahmad, H Jabeen
    Mathematical Problems in Engineering, 2015
  9. Two-stage learning for multi-class classification using genetic programming
    H Jabeen, AR Baig
    Neurocomputing, 2013
  10. Two layered Genetic Programming for mixed-attribute data classification
    H Jabeen, AR Baig
    Applied Soft Computing 2012
  11. GPSO: A Framework for Optimization of Genetic Programming Classifier Expressions for Binary Classification using Particle Swarm Optimization
    H Jabeen, AR Baig
    International journal of innovative computing, information and control, 2011
  12. A Review of classification using genetic programming
    H Jabeen, AR Baig
    International journal of engineering science and technology, 2010
  13. DepthLimited Crossover in Genetic Programming for Classifier Evolution
    H Jabeen, AR Baig
    Computers in Human Behaviour, 2010
  14. Book Chapters
  15. Particle Swarm Optimization Based Tuning of Genetic Programming Evolved Classifier Expressions
    H Jabeen, A R Baig
    Studies in Computational Intelligence (SCI). Vol 284, Springer, 2010
  16. Big Data Europe
    H Jabeen, P Archer, S Scerri, A Versteden, I Ermilov, G Mouchakis, J Lehmann and S Auer
    EDBT/ICDT Workshops, 2017
  17. Big Data Outlook, Tools, and Architectures
    H Jabeen
    Knowledge Graphs and Big Data Processing, Springer, 2020
  18. Scalable Knowledge Graph Processing using SANSA
    H Jabeen, D Graux and G Sejdiu
    Knowledge Graphs and Big Data Processing, Springer, 2020
  19. Conference Proceedings
  20. Multilingual Ontology Merging Using Cross-lingual Matching
    S Ibrahim, S Fathalla, J Lehmann, and H Jabeen
    IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2020
  21. OWLStats: Distributed Computation of OWL Dataset Statistics
    H Allah, S Fathalla, J Lehmann, and H Jabeen
    IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 2020
  22. Metadata standards for the FAIR sharing of vector embeddings in Biomedicine
    S ̧Kafkas, R Celebi, M Ali, H Jabeen, M Dumontier and R Hoehndorf
    Bio-Ontologies, 2020
  23. AutoChef: Automated Generation of Cooking Recipes
    H Jabeen, J Weinz and J Lehmann
    IEEE Congress on Evolutionary Computation (IEEE CEC), 2020
  24. Cross Administration Comparative Analysis of Open Fiscal Data
    F Musyafa, J Lehmann and H Jabeen
    International Conference on Theory and Practice of Electronic Governance, 2020
  25. DISE: A Distributed in-Memory SPARQL Processing Engine over Tensor Data
    H Jabeen, E Hazeiv, G Sejdiu and J Lehmann
    IEEE International Conference on Semantic Computing, 2020
  26. Affinity Dependent Negative Sampling forKnowledge Graph Embeddings
    M Baig, H Jabeen, M Ali, J Lehmann
    Deep Learning for Knowledge Graphs, 2020
  27. Uniform Access to Multiform Data Lakes using Semantic Technologies
    MN Mami, D Graux, S Scerri, S Auer, H Jabeen and J Lehmann
    International Conference on Information Integration and Web-based Applications & Services (iiWAS), 2019
  28. Towards A Scalable Semantic-based Distributed Approach for SPARQL query evaluation
    Gezim Sejdiu, Damien Graux, Imran Khan, Ioanna Lytra, Hajira Jabeen, Jens Lehmann
    International Conference on Semantic Systems, 2019
  29. From Monolingual to Multilingual Ontologies: The Role of Cross-Lingual Ontology Enrichment
    S Ibrahim, S Fathalla, HS Yazdi, J Lehmann, H Jabeen
    International Conference on Semantic Systems, 2019
  30. Scalable Distributed Genetic Algorithm Using Apache Spark (S-GA)
    F Maqbool, S Razzaq, J Lehmann, H Jabeen
    International Conference on Intelligent Computing, 2019
  31. Squerall: Virtual Ontology-Based Access to Heterogeneous and Large Data Sources
    MN Mami, D Graux, S Scerri, S Auer, H Jabeen and J Lehmann
    International Semantic Web Conference, 2019
  32. A Scalable Framework for Quality Assessment of RDF Datasets
    G Sejdiu, A Rula, J Lehmann and H Jabeen
    International Semantic Web Conference, 2019
  33. The KEEN Universe: An Ecosystem for Knowledge Graph Embeddings with a Focus on Reproducibility and Transferability
    M Ali, H Jabeen,CT Hoyt,and J Lehmann
    International Semantic Web Conference, 2019
  34. EvoChef: Show Me What to Cook! Artificial Evolution of Culinary Arts
    H Jabeen, N Tahara, J Lehmann
    International Conference on Computational Intelligence in Music, Sound, Art, 2019
  35. Divided we stand out! forging cohorts for numeric outlier detection in large scale knowledge graphs (conod)
    H Jabeen, R Dadwal, G Sejdiu, J Lehmann
    European Knowledge Acquisition Workshop, 2019
  36. OpenBudgets. eu: A Platform for Semantically Representing and Analyzing Open Fiscal Data
    FA Musyaffa, L Halilaj, Y Li, F Orlandi, H Jabeen, S Auer, ME Vidal
    International Conference on Web Engineering, 433-447
  37. Classifying data heterogeneity within budget and spending open data
    FA Musyaffa, F Orlandi, H Jabeen, ME Vidal
    Proceedings of the 11th International Conference on Theory and Practice of
  38. Managing lifecycle of big data applications
    I Ermilov, ACN Ngomo, A Versteden, H Jabeen, G Sejdiu, G Argyriou, ...
    International Conference on Knowledge Engineering and the Semantic Web, 263-276
  39. Distributed semantic analytics using the sansa stack
    J Lehmann, G Sejdiu, L Bühmann, P Westphal, C Stadler, I Ermilov, S Bin, ...H Jabeen
    International Semantic Web Conference, 2017
  40. The BigDataEurope platform;supporting the variety dimension of big data
    S Auer, S Scerri, A Versteden, E Pauwels, A Charalambidis, ...
    International Conference on Web Engineering, 2017
  41. Big data analytics for behavior monitoring of students
    AR Baig, H Jabeen
    Procedia Computer Science 82, 43-48
  42. Multiclass Classification using Genetic Programming
    International Conference on Knowledge Management (ICKM 2012)
    H Jabeen, AR Baig, J Ahmed
  43. Lazy learning for multi-class classification using genetic programming
    H Jabeen, AR Baig
    International Conference on Intelligent Computing, 177-182
  44. CLONAL-GP framework for artificial immune system inspired genetic programming for classification
    H Jabeen, AR Baig
    International Conference on Knowledge-Based and Intelligent Information and
  45. A framework for optimization of genetic programming evolved classifier expressions using particle swarm optimization
    H Jabeen, AR Baig
    International Conference on Hybrid Artificial Intelligence Systems, 56-63
  46. Opposition based PSO and mutation operators
    M Imran, H Jabeen, M Ahmad, Q Abbas, W Bangyal
    2010 2nd International Conference on Education Technology and Computer 4, V4 … 12
  47. Word length based zero-watermarking algorithm for tamper detection in text documents
    Z Jalil, AM Mirza, H Jabeen
    2nd International Conference on Computer Engineering and Technology 2010
  48. Sponsor-based-architecture for resource management in multi-agent systems
    Z Jalil, H Jabeen
    IADIS Multi Conference on Computer Science and Information Systems(2007)
  49. Opposition based initialization in particle swarm optimization (O-PSO)
    H Jabeen, Z Jalil, AR Baig
    Genetic and Evolutionary Computation Conference (GECCO 2009).
  50. Posters and Demos
  51. EvoChef: Show me What to Cook! Artificial Evolution of Culinary Arts
    H Jabeen,N Tahara and J Lehmann
    International Conference on Computational Intelligence in Music, Sound, Art and Design
  52. Smart Chef - Evolving Recipes
    H Jabeen, C Drachner and J Lehmann
    International Conference on Computational Intelligence in Music, Sound, Art and Design
  53. Querying Data Lakes using Spark and Presto
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer
    WWW 2019
  54. LAMBDA: Learning, Applying, Multiplying Big Data Analytics
    V Janev, J Lehmann, E Sallinger, S Vahdati, D Graux, and H Jabeen
    European Semantic Web Symposium (ESWS 2019)
  55. Clustering Pipelines of large RDF POI Data
    R Dadwal, D Graux, G Sejdiu, H Jabeen, J Lehmann
    European Semantic Web Symposium (ESWS 2019)
  56. OECM: A Cross-lingual Approach for Ontology Enrichment
    Shimaa Ibrahim, Said Fathalla, Hamed Shariat Yazdi, Jens Lehmann and Hajira Jabeen
    European Semantic Web Symposium (ESWS 2019)
  57. Profiting from Kitties on Ethereum: Leveraging Blockchain RDF Data with SANSA
    D Graux, G Sejdiu, H Jabeen, J Lehmann, D Sui, D Muhs, J Pfeffer
    Semantics, 2018
  58. Efficient Data Parsing and Vandalism Detection on (Big) Knowledge Bases using Apache Spark
    N Roqaya, H Jabeen, J Lehmann
    Computer Science Conference for University of Bonn Students, 2018
  59. The Tale of Sansa Spark
    I Ermilov, J Lehmann, G Sejdiu, L Bühmann, P Westphal, C Stadler, S Bin, ...
    International Semantic Web Conference, 2017
  60. Big Data Platform for empowering communities including life sciences
    H Jabeen, J Lehmann
    Conference for Computational Bioscience Research Center, 2015
  61. How to feed the Squerall with RDF and other data nuts?
    Mohamed Nadjib Mami, Damien Graux, Simon Scerri, Hajira Jabeen, Sören Auer, Jens Lehmann
    ISWC (Posters and Demos), 2019
  62. Smart Chef - Evolving Recipes
    H Jabeen, C Drachner and J Lehmann
    International Conference on Computational Intelligence in Music, Sound, Art and Design
  63. Interroger des Lacs de Données en utilisant Spark & Presto
    M N Mami, D Graux, S Scerri, H Jabeen and S Auer
    BDA (Demo Track), 2019
  64. Non Reviewed Reports
  65. ‘Teach me to fish’Querying Semantic Data Lakes
    MN Mami, H Jabeen, S Auer
2020-Present, Senior Researcher & Data Scientist University of Cologne
Working as the Data Science Expert at the Center of Excellence in Plant Sciences
Key research areas: FAIR Research Data Management, Big Data, Knowledge Graphs, Machine Learning, and Project Management
2016-2020, Data Scientist and Research Group Leader University of Bonn
Key research areas
Data Science, Parallel and Distributed Analytics, Knowledge Graphs, Data Mining, Machine Learning, Graph Embeddings, Artificial Intelligence, Semantic Web and related areas
Projects and Management
Technical WP Lead in H2020 Big Data Ocean,
Co-PI and Team member of HSP project Smoothed analysis of Machine Learning Algorithms
Lecturing and tutoring in Gradana, Lambda, Cleopetra
HSP for Distributed Big Data Analytics
Worked as a technical lead at Horizon 2020 funded project Big Data Europe, on the creation of “Big Data Integrator Platform”.
Machine Learning
Provision of technical expertise in projects and use cases for Big Data Ocean, Better, SLIPO, SPECIAL, OpenBudget, Bio2Vec and Blockchain analytics support for Alethio.
Supervision of PhD theses
1. Scalable Distributed Terminological Decision Trees for RDF, Heba Allah, University of Bonn.
2. Distributed Knowledge Fusion in Knowledge Graphs, Shemaa Khalid
3. Multilingual and Heterogenous Fiscal Data Integration, Fathoni Musyafa
4. Scalable processing over large knowledge Graphs, Gezim Sejdiu
Teaching
1. Distributed Big Data Analysis
2. Knowledge Graph Analysis
2015-2016, PostDoctoral Researcher Leipzig University
Worked as the work package lead at Horizon 2020 funded project Big Data Europe in development of a multi-purpose, open-source and scalable platform that is easy to use by communities.
Research in Description Logics, Structured Machine Learning and Semantic Web using Big Data tools like Spark, Flink, Dockers etc.
2014-2015, Assistant Lecturer IT University of Copenhagen
Shared the teaching responsibilities of the assigned courses to prepare and carry seminars, and go through the exercises. I have participated in following courses : 1. Software Architecture, 2. Data Mining. I remained active member of 'GameAI' and 'Real' research groups and worked with Monte Carlo Tree Search Algorithms, Procedural Game development and Evolutionary Algorithms for Games and Arts.
2013-2014, Software Engineer TEO Intl A/S, Copenhagen
Worked in a team-project for accelerated global team building solution.
2012-2013, Head of the Department IQRA University
The department of ‘Computing and Technology’ has more than 20 employees and about 2000 students. I have efficiently delivered and managed multitude of disciplines in my tenure as the Head of the Department.
2009-2013 , Assistant Professor IQRA University
I have taught a variety of subjects at the Undergraduate, Graduate and PhD level. The peer review and student assessment of my courses have always been outstanding.
Project [Role] Abstract Date
Big Data Europe
[Technical Leader]
Technical lead for creation of “Big Data Integrator Platform”. 2015-2018
Big Data Ocean
[Technical Leader]
Technical Lead for the data harmonization and platform for maritime data 2016-2019
OpenBudget
[ML expert]
Technical advisor for platform development and multimodal data analytics  
LAMBDA
[Academic expert]
Knowldge exchange expert and educationist
LAMBDA defines a scientific strategy for stepping up and stimulating scientific excellence and innovation capacity, increasing research capacities and unlocking the research potential of the biggest and the oldest R&D Institute in the ICT area in the whole West Balkan region, turning the Institute Mihajlo Pupin into a regional point of reference when it comes to multidisciplinary ICT competence related to Big Data analytics.
2018-2020
Gradana
[Academic Expert]
Knowldge exchange expert and educationist 2017-2019
Better
[ML Consultant]
Machine Learning and Analytics consultant
BETTER is implementing a Big Data intermediate service layer focused on creating user-centric services and tools, while addressing the full data lifecycle associated with EO data, to bring more downstream users to the EO market and maximise exploitation of Copernicus data and information services.
 
SLIPO
[ML Consultant]
Machine Learning and Analytics consultant
SLIPO develops software, models and processes for: transforming conventional POI formats and schemas into RDF data; interlinking POI entities from different datasets; enriching POI entities with additional metadata, including temporal, thematic and semantic properties; fusing Linked POI data in order to produce more complete and accurate POI profiles; assessing the quality of the integrated POI data; offering value added services based on spatial aggregation, association extraction and spatiotemporal prediction.
 
Cleopetra
[Academic Expert]
Knowldge exchange expert and educationist 2019-2022
Bio2Vec
[Co PI]
Technical lead 2017-2020
PLATOON
[Co PI]
Technical lead 2020-2022
PhD
  1. Scalable Distributed Terminological Decision Trees for RDF, Heba Allah, University of Bonn, Germany
  2. Distributed Knowledge Fusion in Knowledge Graphs, Shemaa Khalid, University of Bonn, Germany
  3. Scalable Multilingual and Heterogenous Fiscal Data Integration, Fathoni Musyafa, University of Bonn, Germany
  4. Distributed RDF processing using Apache Spark, Gezim Sejdiu, University of Bonn, Germany
  5. Performance Analysis of Evolutionary Computation Techniques for Continuous Optimization Problems, Qamar Abbas, Iqra University, Islamabad. 2016
Master Theses
  1. Smart Chef - Evolving Recipes, Carsten Felix Draschner, 2019
  2. Automated Link Discovery for Data Harmonization in the Maritime Domain, Jaime Manuel Trillos, 2019
  3. Scalable Entity Resolution, Amrit Kaur, 2019
  4. Distributed in-memory SPARQL Processing Haziiev Eskender, 2019
  5. Scalable RDF Clustering, Pratik Kumar Agarwal, 2019
  6. Rule Mining on Distributed RDF Data, Kunal Jha, 2018
  7. EvoChef! Teach me how to cook..Machine Generated Recipes Using Evolutionary Algorithm, Nargis Tahara, 2018
  8. Distributed RDF Clustering Framework, Tina Boroukhian, 2018
  9. Distributed Data Parsing and Vandalism Detection on Large-scale Knowledge Graphs Using Apache Spark and Hadoop Ecosystem, Nayef Fayez Roqaya, 2018
  10. Scalable Numerical Outlier Detection in Knowledge Graphs, Rajjat Dadwal, 2018
  11. Association Rule Mining of Linked Data Using Apache Spark, Theresa Nathan, 2017
  12. Reserving Algorithms and Portfolio Analysis for Life Insurance using Apache Spark, Amir Ansari, 2017 (SCOR, Cologne)
  13. Extraction and Fusion of Identity Data, Yujie Diao, 2016 (SMS group GmbH)
  14. Developing a believable MCTSagent in a real-time platform shooter, Lasse Knudsen, Lasse Joergensen 2015
  15. Comprehensive Rule Discovery Using Differential Evaluation For Breast Cancer Survival, Zubair Farooq, 2014
  16. Comprehensive Rules Discovery Using Particle Swarm Optimization For Back Pain Diagnosis, Hasan Kamal, 2013
  17. Fuzzy Logic Based Intelligent Software Requirement Elicitation Technique Selection Model, Wajid Arshad Abbasi, 2013
  18. A modified decreasing inertia weight in Particle Swarm Optimization (PSO), Muhammad Husnain, 2013
  19. Modified Particle Swarm Optimization (PSO) with Laplace Mutation Operator (LMPSO), Muhammad Imran, 2010