Student Theses

Open thesis topics:

Topic 1: Conducting of a Causal Analysis for Sustainable Transportation Based on a Case Study

Background

Currently, we have two datasets that means there are two possible case studies to select only one of them.

The objective of the first possible case study is to investigate the variables that contribute to bike sharing becoming a practicable and environmentally friendly form of transportation. The field of causal inference and discovery is wide. Therefore, we will mainly conduct time-based causal analysis and causal discovery in this thesis work.

In the first case, examples for possible interventions like change in the ticket price, big economy change in the world, COVID, new marketing decision, big event in the city. Similarly, examples for possible cause factors (no intervention) are like the weather, the ticket price, the car petrol price, the university is close.

For another possible case study, the input data is provided by a national railway company. Data related to train traffic and the number of passengers. By supplementing the data, it is possible to study how, for example, the opening of a nearby factory changed travel habits, or what effect a new, modern train has on the traveling public. Data is also available on the trains, so e.g. delays also can be analyzed.

Main steps:

  • Causal discovery algorithms used in industry (literature research)
  • Algorithms for causal discovery and their implementations (literature research)
    • Including the introduction and comparison of the available libraries
    • (Emphasize the distinctions between causal impact analysis and causal discovery)
  • Case study
    • Input data preparation, visualization
    • Formulating the questions we want to answer
    • Library selection, implementation, verify the result
    • Explain/understand the results
  • One possible case study
  • Another possible case study
  • Evaluation of the case study

During implementation, Python is the preferred language. We already looked at one of the Python versions of Google’s R package for causal invention analysis, called Causalimpact. Similarly, we have already employed the Tigramite Python package for causal discovery (i.e., while we are examining no intervention).

This work will also be a part of a PhD work, so the achieved results will be published together with it too.

References

Reading about the bike sharing in the circular economy:

Henriksson, M. and Scalzotto, J.G., 2023. Bike-sharing under pressure: The role of cycling in building circular cycling futures. Journal of Cleaner Production, 395, p.136368.

Reading about the causal discovery in public transport:

Zhang, N., Graham, D.J., Hörcher, D. and Bansal, P., 2021. A causal inference approach to measure the vulnerability of urban metro systems. Transportation, pp.1-32.

Contact

Tamas Fekete
tfekete@constructor.university


Topic 2: Unlocking Circular Economy Potential: The Role of Digital Product Passports (DPP) in Enhancing Resource Efficiency throughout Supply Chain

Background:
This thesis seeks to explore the transformative role of digital product passports (DPP) in the context of circular economy principles. In the face of escalating environmental concerns and resource scarcity, circular economy practices have emerged as a promising solution to decouple economic growth from resource depletion. Digital product passports, a concept gaining traction in various industries, present an innovative approach to enhance traceability, transparency, and sustainability across the product lifecycle. This research aims to examine the implications, challenges, and benefits of implementing digital product passports as a tool to facilitate circularity. Through an in-depth analysis of existing literature, case studies, and empirical data, this study aims to shed light on the potential of digital product passports in revolutionizing supply chain management and consumer behavior toward a more circular and sustainable economy.

Objectives (Customized)

  • Investigate the foundational principles of circular economy and the current state of circular practices in industries.
  • Examine the concept of digital product passports and their role in enhancing transparency and traceability throughout the product lifecycle.
  • Analyze the impact of digital product passports on promoting resource efficiency, reducing waste, and fostering sustainable consumption patterns.
  • Evaluate the challenges and barriers associated with the implementation of digital product passports, including technological, regulatory, and economic considerations.
  • Explore case studies and best practices from industries that have successfully adopted digital product passports to promote circularity.
  • Propose recommendations for the strategic integration of digital product passports into various sectors to maximize their contribution to circular economy goals.

Methodology

This research will employ a combination of systematic literature review, case studies, and empirical analysis. The literature review will provide a comprehensive understanding of circular economy principles, digital product passports, and their intersection. Case studies will be utilized to examine real-world examples of successful implementations, while empirical analysis through surveys or interviews will offer insights into stakeholder perspectives and challenges associated with adopting digital product passports.

Significance

The findings of this thesis will contribute to the growing body of knowledge on circular economy practices and the potential of digital product passports in advancing sustainability goals. The research outcomes can inform policymakers, industry professionals, and consumers about the benefits and challenges of integrating digital product passports into supply chains. Ultimately, this study aims to provide a foundation for further research and guide the strategic adoption of digital product passports as a tool for promoting resource-efficient, circular economic systems.

References

https://doi.org/10.3390/en14082289

https://www.sciencedirect.com/science/article/pii/S221282712200021X

https://www.circularise.com/blogs/digital-product-passports-dpp-what-how-and-why


Topic 3: Revolutionizing Mobility: Unveiling the Potential of Artificial Intelligence in Cooperative, Connected, and Automated Mobility (CCAM)

Background
This thesis endeavors to explore the transformative role of Artificial Intelligence (AI) in shaping the landscape of Cooperative, Connected, and Automated Mobility (CCAM). As the transportation sector undergoes a paradigm shift towards interconnected and autonomous systems, AI emerges as a critical enabler in realizing the full potential of CCAM. This research aims to delve into the multifaceted contributions of AI technologies, encompassing machine learning algorithms, data analytics, and decision-making models, in enhancing the efficiency, safety, and sustainability of cooperative and connected transportation networks. Through an extensive examination of literature, case studies, and empirical data, the study seeks to unravel the complexities and potentials of AI in the evolution of CCAM.

Objectives

  • Investigate the current state of Cooperative, Connected, and Automated Mobility, identifying key challenges and opportunities in the integration of AI technologies.
  • Examine the specific roles of AI in CCAM, including machine learning algorithms for intelligent decision-making, data analytics for predictive maintenance, and communication protocols for seamless connectivity.
  • Assess the impact of AI-driven solutions on enhancing traffic management, safety, and overall performance within cooperative and connected transportation systems.
  • Analyze the integration of AI with communication technologies such as V2X (Vehicle-to-Everything) for real-time coordination and information exchange.
  • Evaluate the ethical considerations, regulatory frameworks, and societal implications surrounding the deployment of AI in CCAM.
  • Explore case studies and real-world applications where AI has demonstrated significant advancements in improving efficiency and safety within CCAM.

Methodology

This research will employ a mixed-methods approach, combining a systematic literature review with case studies and empirical analysis. The literature review will synthesize existing knowledge on CCAM and the application of AI. Case studies will be used to examine real-world implementations, while empirical analysis through simulations or field studies will provide insights into the practical performance and challenges associated with AI-driven CCAM systems.

Significance

The findings of this thesis will contribute to the ongoing discourse on the convergence of AI and CCAM, providing valuable insights for researchers, policymakers, and industry stakeholders. The research outcomes aim to inform the development and deployment of AI technologies to optimize the cooperative, connected, and automated mobility landscape. Ultimately, this study strives to facilitate the creation of more intelligent, efficient, and sustainable transportation systems through the strategic integration of AI within the context of CCAM.

Reference

https://www.mdpi.com/2071-1050/13/19/10610


Topic 4: Digital Technologies Transforming Sustainable Forest Management: A Comprehensive Exploration

Background

This thesis aims to investigate and analyze the pivotal role of digital technologies in advancing sustainable forest management practices. As the world grapples with environmental challenges, there is a growing recognition of the need to balance economic development with ecological preservation. Digital technologies offer unprecedented opportunities to enhance the efficiency, precision, and sustainability of forest management processes. This research delves into the multifaceted impact of digital technologies, such as Geographic Information Systems (GIS), remote sensing, Internet of Things (IoT), and artificial intelligence, on the sustainable management of forests. Through a systematic review of existing literature, case studies, and empirical data, the study aims to contribute insights into the transformative potential of digital technologies in addressing contemporary challenges and fostering long-term ecological resilience.

Objectives

  • Investigate the current state of sustainable forest management practices and identify existing challenges.
  • Analyze the role of digital technologies, including GIS, remote sensing, IoT, and artificial intelligence, in addressing challenges and optimizing forest management processes.
  • Explore the environmental, economic, and social implications of integrating digital technologies into sustainable forest management.
  • Assess the effectiveness of existing digital tools and technologies in promoting transparency, accountability, and stakeholder engagement in forest management.
  • Examine case studies from different regions to highlight successful applications of digital technologies in sustainable forest management.
  • Propose recommendations and guidelines for the strategic implementation of digital technologies to enhance sustainability in forest management practices.

Methodology

This research will adopt a mixed-methods approach, combining a systematic literature review with case studies or interview. The literature review will synthesize existing knowledge on sustainable forest management and the application of digital technologies. I provide practical insights into the real-world impact of digital technologies on sustainable forest management.

Significance

The findings of this thesis will contribute to the academic discourse on the intersection of digital technologies and sustainable forest management. Additionally, the research outcomes can inform policymakers, forest managers, and technology developers on the potential benefits and challenges associated with integrating digital solutions into forestry practices. Ultimately, the study aims to provide a foundation for further research and guide the strategic adoption of digital technologies to foster sustainable and resilient forest ecosystems.

 


Topic 5: Conceptual model of platform for digital asset management to enable cross-company collaborative digital twin development for transport infrastructure management

Background

In the transport sector, implementing digital twins is part of the digitization measure to improve resource efficiency in infrastructure management. However, the use of digital twins is still limited due to challenges such as the lack of a shared understanding of digital twin models, complex model integration, security issues, lack of access to essential data, and high costs due to inefficient business models. A platform for asset management platform will enable cross-company collaborative development and use of digital twins (DC). The platform aims to reduce the cost and effort of creating and using digital twins that improve transparency in the planning and operation of transport infrastructures.

Research Questions

  • What transport infrastructure management scenarios require digital twin solutions?
  • What are the essential components of digital asset management platform for digital twin?
  • Which assets and data sources are required?

Tasks

  • Analysis of existing digital twin platform solutions: literature review
  • Analysis of typical transport infrastructure management scenarios including actors, use-cases, digital asset and data requirements: literature review
  • Identification of essential components of the platform
  • Development of the conceptual model
  • Validation with experts

References

BMVI (2021) Masterplan BIM Bundesfernstraßen – Digitalisierung des Planens, Bauens, Erhaltens und Betreibens im Bundesfernstraßen-bau mit der Methode Building Information Modeling (BIM) [online].

https://aws.amazon.com/de/iot-twinmaker/

https://docs.microsoft.com/en-us/azure/digital-twins/overview

https://www.bentley.com/en/products/product-line/digital-twins/itwin

https://vertexvis.com/products/vertex-digital-twin-platform


Topic 6: Analysis and development of a standardized asset model for digital twin for transport infrastructure management

Background

The introduction of DZ in companies in Germany, Austria and Switzerland is still limited. Although 54% of companies have an end-to-end strategy for DC, only 8% of companies are already fully utilizing their DC. The central challenges are understanding the data and integrating models of the digital assets (DA) of individual DCs in a simulation of the overall system. For SMEs, the questions and uncertainties lie in the business models, costs, security aspects and access to essential data/information, among other things. Therefore, having a standardized information model expressing the standard terms, taxonomy, and relations of assets is essential.

Research Questions

  • What transport infrastructure management scenarios require digital twin solutions?
  • Which assets and data sources are required?
  • What standardized information model in transport infrastructure management exists?
  • How do we express standardized terms, taxonomy, and relations the best?

Tasks

  • Analysis of typical transport infrastructure management scenarios including actors, use-cases, digital asset and data requirements: literature review
  • Analysis of existing standards for transport infrastructure management: literature review
  • Development of asset taxonomy for digital twin for transport infrastructure management
  • Formalization in OWL/RDF
  • Validation

References

https://learn.microsoft.com/en-us/azure/digital-twins/concepts ontologies

https://protege.stanford.edu/publications/ontology_development/ontology101.pdf

https://www.iaarc.org/publications/fulltext/ISARC_2019_Paper_117.pdf

https://www.researchgate.net/publication/280949391_An_ontology_based_approach_to_traffic_management_in_urban_areas


Topic 7: Object detection and semantic annotation from 3D images of digital twin assets for transport infrastructure management using machine learning

Background

One of the typical data formats of digital twin assets is 3D images/drawings, e.g. car, bridge, and train models. These 3D images have to be integrated into the digital twin platform by annotating and linking the 3d images to the semantic structure. Thus, objects in the 3D images have to be automatically detected based on labels/terms defined in the semantic model/ontology.

Research Questions

  • What object detection method is the best to detect objects from 3D drawings?
  • How can we integrate the detected object into the digital twin of transport management ontology?

Tasks

  • Finding the ontology (creating a simple ontology) as the source of labelling
  • Image/drawing selection and image engineering/manipulation to enrich the dataset
  • Labelling
  • Feature engineering
  • Object detection using e.g. YOLO
  • Validation

Topic 8: Development of metrics to measure the explainability of machine learning models for digital twin

Background

Extracting explainable insights from the data managed by digital twins will improve the trust of the full spectrum of stakeholders in transport infrastructure management. This can be achieved through explainable AI (XAI). However, a metric is required to assess the quality and degree of the explainability of XAI.

Research Questions

  • What criteria are required to assess the explainability of machine learning?
  • Are the criteria the same for all stakeholders?
  • How can we calculate the score of explainability based on the identified criteria?

Tasks

  • Identification of stakeholders in transport infrastructure management and their level of knowledge about AI
  • Literature review on explainability measurement/assessment
  • Selection of measurement/assessment criteria
  • Development of calculation model
  • Validation

References

https://arxiv.org/abs/1812.04608


Topic 9: Implementing Explainable DL models for sentiment analysis in the Food Delivery Services

Research Questions

How to apply Explainable Deep Learning models for Sentiment Analysis in the Food Delivery Services ?

Tasks

  • Literature Review about Explainable Deep Learning and Sentiment Analysis
  • Exploratory Food Delivery Data Analysis
  • Model Comparison between Conventional ML/AI and Explainable AI or Interpretable ML
  • Result Analysis

References

Adak, A., Pradhan, B., & Shukla, N. (2022). Sentiment Analysis of Customer Reviews of Food Delivery Services Using Deep Learning and Explainable Artificial Intelligence: Systematic Review. Foods, 11(10), 1500.
Drus, Z.; Khalid, H. Sentiment Analysis in Social Media and Its Application: Systematic Literature Review.Procedia Comput. Sci.2019,161, 707–714
Johnson, R.; Zhang, T. Effective Use of Word Order for Text Categorization with Convolutional Neural Networks.arXiv 2014,arXiv:1412.1058.
Moreno Lopez, M.; Kalita, J. Deep Learning Applied to NLP.arXiv2017,arXiv:1703.03091.
Pang, B.; Lee, L.; Vaithyanathan, S. Thumbs Up? Sentiment Classification Using Machine Learning Techniques.arXiv2002,arXiv:cs/0205070.

Contact:

Rahmat Hidayat
r.hidayat@jacobs-university.de


Topic 10: Anomaly Detection in Building Energy-Saving Information Systems Using Internet of Things and Explainable Agent-Based Algorithms

Research Questions

How to apply Explainable Agent-Based Algorithms in Building Energy Saving Information System?

Tasks

  • Literature Review
  • Exploratory Data Analysis
  • Model Comparison
  • Result Analysis

References

Forestiero, A. Metaheuristic Algorithm for Anomaly Detection in Internet of Things leveraging on a Neural-driven Multiagent System. Knowl.-Based Syst.2021,228, 107241.
Forestiero, A.; Mastroianni, C.; Spezzano, G. Reorganization and Discovery of Grid Information with Epidemic Tuning. Future Gener. Comput. Syst. 2008,24, 788–797
Forestiero, A.; Papuzzo, G. Agents-Based Algorithm for a Distributed Information System in Internet of Things.IEEE Internet Things J.2021,8,

Contact:

Rahmat Hidayat
r.hidayat@jacobs-university.de

 


Topic 11: Ontology and Knowledge Graph Development for Sustainability/ESG* Performance Measurement in Manufacturing

*ESG: Environment, Social, Governance

Background
The complexity of the sustainability aspects in manufacturing has grown that include factors in the environment, social, and governance (ESG). As a comprehensive sustainability report is mandatory and must meet certain minimum requirements, failure to maintain sustainability aspects can lead to potential issues, e.g. failure of risk management and incompatibility with regulations that fail business activities.

Research Questions

  • How are current sustainability performances measured in the manufacturing industry?
  • How to build an Ontology and Knowledge Graph (KG) to help manage ESG performance measurement and solve interoperability issues in manufacturing?

Tasks

The whole tasks are estimated will be finished in March 2023, that consists of:

  • Develop ontology and KG related to ESG performance measurement.
  • Utilize and analyze current sustainability performance reports and extract information using NLP and Deep learning. The candidate tool for text extraction is already available.
  • Map and align extracted information to ontology and KG.
  • Put data from data sources to KG.
  • Visualize knowledge graph results, e.g. using Neo4j, Stardog, etc.

The work progress will manage by using a management project tool named JIRA.

References

https://www.sciencedirect.com/science/article/pii/S0167739X2033003X?via%3Dihub

Contact:

Annas Vijaya
a.vijaya@jacobs-university.de


 

Topic 12: Knowledge graph building for artificial intelligence for better decisions in manufacturing

Research Questions and Tasks

  • What are the knowledge graphs, and how do you construct them?
  • Search/introduce companies where the knowledge graphs are applied (main focus is the automotive industry)
  • How can knowledge graphs and AI be used together?
  • Chose a (simple) problem for a case study (earlier points can help)
  • Search and compare AI algorithms/techniques focusing on the earlier introduced case study/problem
  • Formulate suggestions on which algorithm/technique can use to solve the selected case study better, design the way
  • This topic is based on literature research and design
  • Programming is not required (kind of black box)
  • This topic is feasible for Bachelor students

Contact:

Tamas Fekete
t.fekete@jacobs-university.de


 

Topic 13: Comparing and analysing different ontology mapping tools

Background
Ontologies are a convenient way for knowledge representation. When ontologies are involved in system integration, it requires communication between two or more ontologies. Data to-ontology Mapping is the process whereby data and ontology are semantically related at a conceptual level, i.e. correspondences are established between the data source components and the ontology components. There are some tools for different mapping formats of data into ontologies. The goal here is to compare and evaluate the existing mapping tools.

Research Questions

  • What ontology mapping tools exist?
  • How do these tools perform?
  • What methods were these tools used?

Tasks

  • Finding all the mapping tools that are exist.
  • Comparing and analysing the mapping tools.
  • Determining the techniques that these tools are used.

 

References

[1] https://link.springer.com/chapter/10.1007/978-3-319-25639-9_24.

[2] https://arxiv.org/pdf/1906.08092.pdf.

Contact:

Tina Boroukhian
t.boroukhian@jacobs-university.de


Topic 14: Causal machine learning/inference to identify the causality of factors driving people doing voluntary carbon offsetting.

Background
Voluntary carbon offsetting is an approach to compensate activities that cause intensive greenhouse gas emissions and cannot be avoided, for example, intercontinental flying, package shipping, moving trucks, etc. The compensations are done by offsetting the carbon emission to fund projects that reduce greenhouse gas emissions, for example, installation of renewable energy power plants, electrification of transportation, the introduction of efficient cookstoves, etc. Therefore, a global net-zero emission can be achieved. This thesis focuses on empirical research and should apply causal machine learning which is a novel approach in AI.

Research Questions

  • What factors contribute to people who want to do voluntary carbon offsetting for their activities?
  • How are the causal relationships among those factors based on collected dataset?

Tasks

  • Identify potential factors in the literatures that drive the intentions of people to contribute to global net-zero emission
  • Develop hypotheses and formalize causal relationship e.g. using causal graph.
  • Design a survey to test the hypotheses
  • Collect the data through a survey
  • Causal machine learning to analyze the collected data using tools such as DoWhy, ShowWhy
  • Description and discussion of the results

References

https://ideas.repec.org/a/eee/touman/v45y2014icp194-198.html

https://www.sciencedirect.com/science/article/pii/S0261517720302028

https://www.sciencedirect.com/science/article/pii/S0959652621031206 (analysis of survey data)

https://arxiv.org/abs/2206.15475 (causal machine learning)


Topic 15: Causal machine learning/inference to identify the causality of factors driving people shopping online of products having social -cultural values

Background
eCommerce customers intend to purchase products not only because of price, quality, and environment-friendliness but also because of the social-cultural values of the products. For example, a customer wants to purchase products that can contribute to the operation of an orphanage or products that are made by a rural poor community from a certain country with a particular culture. However, since the products are purchased online, it is difficult for the customers to experience the social-cultural values of the products directly. This thesis focuses on empirical research on how social-cultural values and perceptions influence purchasing behaviors of eCommerce customers. The thesis should apply causal machine learning which is a novel approach in AI.

Research Questions

  • What social-cultural factors of products contribute to people purchasing online?
  • How are the causal relationships among those factors based on collected dataset?
  • How to measure social-cultural value of an eCommerce product?

Tasks

  • Identify potential social-cultural factors on products that drive the intentions of people to purchase online through literature review
  • Develop hypotheses by correlating among the identified factors. The correlations can be positive or negative
  • Design a survey to test the hypotheses
  • Collect the data through a survey
  • Causal machine learning to analyze the collected data using tools such as DoWhy, ShowWhy
  • Development of social-cultural score based on identified factors
  • Description and discussion of the results

References

https://www.sciencedirect.com/science/article/pii/S0959652621012129

http://www.journals.researchub.org/index.php/JMAS/article/view/351

https://www.sciencedirect.com/science/article/pii/S0959652621031206 (analysis of survey data)

https://arxiv.org/abs/2206.15475 (causal machine learning)


Topic 16: The Role of Causal Machine Learning (CML) to Improve Decision Making in Supply Chain Management

Background

CML is an emerging and growing field in applying AI in business. Causal inference in machine learning is a statistical tool that enables AI and machine learning algorithms to reason based on cause and effect. Understanding cause and effect would make existing AI systems more intelligent and more efficient. Moreover, it can help organizations understand their data. Some possible scenarios that may apply CML [1]:
#1: In an e-commerce context, we could determine which specific factor impacts the decision to purchase a product most. We could better allocate resources to improve a particular KPI with this information. We could also rank the impact of different factors on the purchasing decision. We could determine if a given customer would have purchased a specific product if he/she had not bought other products for the last two years.
#2: In a broader sense, we could discover how and what a given business strategy could have avoided negative impacts? We could also determine how much we expect our sales to increase by implementing a specific training program to our business developers. the impact of a specific training program
#3: In the agricultural field, we often try to predict if a farmer’s crop yield will be lower this year. However, using casual inference, it will become to better understand what steps should we take to increase the harvest.

Research Questions

  • What scenarios in supply chain management may apply CML?
  • Which CML approach and tool is the most appropriate to improve supply chain processes?
  • How to measure the most appropriate approach?
  • What improvements in terms of decision making can be achieved?

Tasks

  • Define causality scenarios in supply chain contexts
  • Find an appropriate dataset
  • Literature review on related work, different approaches of causal inference (Uplift Random Forests, Meta Learners, SEM, Bayesian Network, etc.), and tools (CausalML [3], CausalNex [4])
  • Data preprocessing and exploratory analysis
  • Model implementation using chosen tools
  • Evaluation of the results
  • Discussions and documentation

Starting Literature
[1] https://towardsdatascience.com/introduction-to-causality-in-machine-learning-4cee9467f06f
[2] Dataset examples: https://github.com/rguo12/awesome-causality-data
[3] https://causalml.readthedocs.io/en/latest/about.html
[4] https://causalnex.readthedocs.io/en/latest/01_introduction/01_introduction.html#
[5] https://innovationatwork.ieee.org/what-is-causal-inference-and-why-is-it-key-to-machine-learning/
[6] https://www.sciencedirect.com/science/article/pii/S0278431921000578

 

 


Topic 20: Ontology Alignment using Multiple Similarity Measures

Background

Ontologies are convenient way for knowledge representation. When ontologies are involved in system integration, it requires communication between two or more ontologies. Ontology alignment (a process to generate a set of correspondences between similar entities of various ontologies) can be described as follows: given two ontologies, each describing a set of discrete entities (which can be classes, properties, etc.), find the similarities that hold between these entities. For automatic ontology matching, we need to combine some kinds of different similarity measures: lexical-based, Language-based, structure-based, and semantic-based techniques as well as using information in ontologies including names, labels, comments, relations and positions of concepts in the hierarchy and integrating WordNet dictionary.

Research Questions

  • Which similarity measures approaches are more suitable?
  • Which features are needed to extract from the ontologies for implementing the similarities?
  • Which libraries are more suitable for this task?
  • Can you develop the presented similarities to reach to the better F-measure?
  • Which approaches is more flexible to compute the similarities for large ontologies?

Tasks

  • Studying and comparing different similarity measures approaches
  • Finding and extracting suitable features from ontologies
  • Implementing different similarity measures (lexical-based, Language-based, structure-based, and
    semantic-based techniques)
  • Combining different similarity measures techniques as a final similarity

Programming Language: Python
Library: owlready2, BeautifulSoup, nltk

Starting References
[1] Nguyen, Thi Thuy Anh, and Stefan Conrad. “Ontology matching using multiple similarity measures.” 2015 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K). Vol. 1. IEEE, 2015.
[2] Nguyen, Thi Thuy Anh, and Stefan Conrad. “Combination of lexical and structure-based similarity measures to match ontologies automatically.” International Joint Conference on Knowledge Discovery, Knowledge Engineering, and Knowledge Management. Springer, Berlin, Heidelberg, 2012.
[3] Bulygin, Lev, and Sergey A. Stupnikov. “Applying of Machine Learning Techniques to Combine Stringbased, Language-based and Structure-based Similarity Measures for Ontology Matching.” DAMDID/RCDL. 2019.

Contact:

Tina Boroukhian
t.boroukhian@jacobs-university.de


Topic 21: Mapping Data (JSON, CSV) to Existing Ontologies and populate the ontologies by the Data contents using Python Libraries

Background

Ontologies are convenient way for knowledge representation. When ontologies are involved in system integration, it requires communication between two or more ontologies. Database-to-Ontology Mapping is the process whereby a database and an ontology are semantically related at a conceptual level, i.e. correspondences are established between the database components and the ontology components. For automatic mapping data to ontologies, we need to find the related components between ontology and our data and applying the mapping between them.

Research Questions

  • Which ontologies are related to our data? What are the ontology domains for our data?
  • Which libraries are more suitable for this task?
  • Which approaches is more flexible to map the big data to ontologies?
  • How can you improve the program for running the big amount of data?

Tasks

  • Studying and comparing different python libraries for purpose ontology
  • Finding the related components between ontology and data
  • Reading JSON file from the Api and mapping it to related ontologies
  • Populating the ontologies by our data contents
  • Saving the mapping file into the Neo4j

Programming Language: Python
Library: owlready2, RDFLib

Starting References
[1] Munir K., Anjum M. S., (2018) The use of ontologies for effective knowledge modelling and information
retrieval, Applied Computing and Informatics , 14(2), 116-126.
[2] Shah, T., Rabhi, F. & Ray, P. (2015) Investigating an ontology-based approach for Big Data analysis of
inter-dependent medical and oral health conditions. Cluster Compute 18, 351–367 (2015). https://doi.org/
10.1007/s10586-014-0406-8.
[3] https://owlready2.readthedocs.io/en/latest/onto.html.
[4] https://neo4j.com/labs/neosemantics/4.0/introduction/

Contact:

Tina Boroukhian
t.boroukhian@jacobs-university.de


Topic 22: Ontology Alignment using Machine Learning and Neural Networks Techniques

Background

Ontologies are convenient way for knowledge representation. When ontologies are involved in system integration, it requires communication between two or more ontologies. Ontology alignment is a process of establishing correspondences between semantically related entities in different ontologies. Applying machine learning between ontology elements as features for ontology matching. Logistic Regression, Random Forest classifier and Gradient Boosting are used as machine learning methods.

Research Questions

  • Which Machine Learning and neural networks Techniques are more suitable?
  • Which features are needed to alignment the ontologies?
  • Which libraries are more suitable for this task?
  • Can you develop the presented similarities to reach to the better F-measure?
  • Which approaches is more flexible to compute the alignment for large ontologies?

Tasks

  • Studying and comparing different machine learning and neural network techniques
  • Creating predicted alignment from two lists of entities
  • Applying machine learning methods to existing similarity measures and creating dataset and training a machine learning model

Programming Language: Python
Library: scikit-learn, owlready2, BeautifulSoup

Starting References

[1] Bulygin, Lev, and Sergey A. Stupnikov. “Applying of Machine Learning Techniques to Combine Stringbased,
Language-based and Structure-based Similarity Measures for Ontology Matching.” DAMDID/RCDL.
2019.
[2] Nezhadi, Azadeh Haratian, Bita Shadgar, and Alireza Osareh. “Ontology alignment using machine
learning techniques.” International Journal of Computer Science & Information Technology 3.2 (2011): 139.
[3] Bento, Alexandre, Amal Zouaq, and Michel Gagnon. “Ontology matching using convolutional neural
networks.” Proceedings of the 12th language resources and evaluation conference. 2020.

Contact:

Atit Bashyal
a.bashyal@jacobs-university.de


Topic 23: Querying Ontologies using NLP: A Natural Language based Query Interface to OWL Ontologies for Demand-Response System

Background

Ontologies are convenient way for knowledge representation. Accessing structured data in the form of ontologies requires training and learning formal query languages (e.g., Cypher or SPARQL) which poses significant difficulties for non-expert users. One of the ways to lower the learning overhead and make ontology queries more straightforward is through a Natural Language Interface In this project we need to Implement the Question-based Interface to Ontologies – a tool for querying ontologies.

Research Questions

  • How we can extracting domain knowledge automatically?
  • How we can translating the text query to formal query language, e.g. Cypher or SPARQL?
  • Which libraries are more suitable for this task?

Tasks

  • Extracting domain knowledge automatically
  • Extracting a text query as an input and transforms it to the set of queries expressed in a formal language, e.g. Cypher or SPARQL
  • Connecting to Neo4j and querying ontology based on formal language, e.g. Cypher or SPARQL

Programming language and libraries

  • Python
  • nltk, owlready2

Starting References

[1] Damljanovic, Danica, Valentin Tablan, and Kalina Bontcheva. “A Text-based Query Interface to OWL Ontologies.” LREC. 2008.
[2] Bernstein, Abraham, et al. “Querying ontologies: A controlled english interface for end-users.” International Semantic Web Conference. Springer, Berlin, Heidelberg, 2005.

Contact:

Tina Boroukhian
t.boroukhian@jacobs-university.de


Topic 24: How artificial intelligence can improve the management of large projects? A causal machine learning approach

Background

Even though the impact of artificial intelligence, predictive analytics and machine learning has proved to be significant in different areas of automation and engineering, there is a lack of research on their effect on project management.

Research Questions

  • How artificial intelligence can improve the management of large projects?
  • What aspects of project management can be improved through AI?
  • Which factors influence the willingness of projects/project managers to adopt AI?
  • How are the causal relationships among those factors based on collected dataset?

Tasks

  • Systematic and comparative literature review on the applications of AI and machine learning in project management
  • Identification of aspects/factors in project management that can be improved through AI.
  • Identification of factors influencing the willingness of project managers/teams to adopt AI
  • Development of causal graph based on the identified factors
  • Survey 
  • Causal machine learning to analyze the collected data using tools such as DoWhy, ShowWhy

Starting References

https://medium.com/the-project-office/artificial-intelligence-in-project-management-68ddf2ad91d7

https://www.liquidplanner.com/blog/seven-future-trends-in-project-management/


Topic 25: Managing interoperability in circular economy

Background

The circular economy concept involves collaborations of different human actors and information systems throughout the supply chain. Those different actors and systems need to share reliable data and innovative solutions with each other to improve decision makings and process efficiency. In other domains like healthcare [1], and inter-organization collaboration, ontologies and standards are used to define common understanding on a semantical level. Those ontologies and standards provide agreed vocabularies that represent entities, processes, and resources in the whole system

Research Questions

  • What are typical actors, systems, and solutions involved in the circular economy?
  • What vocabularies and relation models are required to facilitate the interoperability of those entities?

Tasks 

  • Systematic and comparative literature review
  • Identification of actors, systems, and exchanged information and solutions between actors/systems
  • Identification of common vocabularies to facilitate interoperability
  • Development of relation model representing the vocabularies and their relations

Starting References 

[1] https://www.sciencedirect.com/science/article/pii/S1877050912004024

[2] https://link.springer.com/chapter/10.1007/978-3-642-15961-9_86

[3] http://ceur-ws.org/Vol-2044/paper10/paper10.html

 


Topic 26: The impacts of dynamic electricity price on production planning and control

Background

The introduction of dynamic electricity prices in the manufacturing (energy consumer) requires the production planners to consider the price fluctuation in their production planning and controls. The production processes have to be shifted to the time points where the electricity price is low.

Research questions

  • What activities in production planning and control will be influenced by the dynamic electricity price?
  • What objectives, variables, and constraints are required to be changed compared to conventional production planning and control?
  • How can production planning and control be optimized?

Tasks 

  • Systematic and comparative literature review
  • Analysis of different dynamic price models and their impacts on the planning time horizon
  • Identification objectives, variables, and constraints for the optimization
  • Development of optimization model
  • Solving the optimization model using tools, eg. using linear programming, ant colony optimization, etc.

Starting References

https://www.sciencedirect.com/science/article/pii/S2212827116001281

https://publikationen.bibliothek.kit.edu/1000060882


Topic 27: Energy consumption and cost prediction of customized products using data analytics or machine learning

Background

To improve the competitiveness in the market, companies transform their business by providing customized products. A stainless steel manufacturing SME, who is currently the leader in a niche market of stainless steel application in the oil and gas industry, provides customized steel products that allow customers to configure the material, size, shape, heat treatment, etc. However, it leads to challenges in estimating the price and energy consumption of the customer configured products individually.

Research Questions

  • What variables influence the energy consumption and costs?
  • Which data analytics/machine learning methods can be applied to estimate the energy consumption and costs? Which method is the best?
  • What would be the actions to save the energy consumption and costs after predicting them?

Tasks

  • Comparative literature review
  • Data collection, pre-processing, exploratory data analytics
  • Building machine learning models
  • Evaluation of the models
  • Critical analysis of results and actions required to achieve energy and cost-saving

Starting References

https://publikationen.bibliothek.kit.edu/1000060882

 


Topic 28: How does COVID-19 give impact to the supply chain of different products (e.g. hygiene, electronics/IT, typical product )? A data analytics approach

Background

The global COVID-19 pandemic has a strong impact to world economy due to the limitation of supply chain activities. The supply chain of certain types of products are negatively affected but not for some other types of product. This thesis focuses on the analysis of the supply chain of different types of products during the global pandemic by analyzing the available data.

Research Questions

  • What types of products are strongly affected by the COVID-19 global pandemic?
  • How the pandemic affects the supply chain of those products?
  • What model can be used to describe the correlations of the pandemic to the supply chain. (note: focus on the different time series models e.g. Holt-Winters, ARIMA, SARIMA, VAR, etc. or regression models, e.g. linear, polynomial, random forest, ridge, etc.)

Tasks

  • Comparative literature review to analyze related works
  • Identification of supply chain scenarios affected by COVID-19 pandemic
  • Data collection
  • Exploratory data analysis
  • Building the time series and regression models
  • Evaluation of the results

Starting References

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7413852/

 


Topic 29: The roles of digital twins in sustainable supply chain

A digital twin can be used as a virtual supply chain replica that consists of hundreds of assets, warehouses, logistics and inventory positions [1]. It is gaining more attention in the industry due to improvements in technical and computational capabilities with operations technology.

Research Questions

  • What factors make a supply chain more sustainable?
  • What are the requirements for applying digital twins to a supply chain?
  • What scenarios in a supply chain can be improved through digital twins?

Tasks

  • Comparative literature review on sustainable supply chains and digital twins
  • Identification of essential factors in applying digital twins in supply chains
  • Development of general sustainable supply chain scenarios
  • Two possible methodology
    • Cross case analysis: combining literature studies and interview on different supply chain scenarios
    • Surveys
  • Analysis of literature studies + interview or survey results

Starting References

[1] https://www.supplychaindigital.com/technology/evolution-digital-twins-supply-chain

[2] https://link.springer.com/article/10.1007/s11036-020-01557-9


 

Student Theses

NameTopicYearThesis Category
Edmundo CuadraData Analytics Approach for Penetrating the Aircraft Manufacturing Market Duopoly2021B.Sc. IEM
Egor SarafanovArtificial Intelligence in Project Management: What factors influence the level of AI adaptation?2021B.Sc. IEM
Ishansh GuptaEnhancing Football Analytics using Data Science2021M.Sc. DE
Korin HoxhaTowards the Construction of a Supply Chain Management Digital Twin using Data Management and Data Analytics2021B.Sc. IEM
Mohid QaiserThe effect of dynamic electricity price on production planning and control2021B.Sc. IEM
Oumeng XiaHow COVID-19 Give Impact to the Supply Chain of Food and Hygiene2021B.Sc. IEM
Pardis SahraeiIntegration of Drones Technology and the Public Transportation System for the Last Mile Delivery by Considering Dynamic Distribution Centers2021M.Sc. IEM
Sebastian GonzalezImpact of Machine Learning and Natural Language Processing on Products and their Features2021B.Sc. IEM
Tammy SiyakurimaHow can Artificial Intelligence Improve the Management of Large Projects? 2021B.Sc. IEM
Temirlan AikenovEnergy consumption and cost prediction of customized products using data analytics or machine learning2021B.Sc. IEM
Thanh Tuan NguyenHandling interoperability in dynamic electricity market2021B.Sc. IEM
Unai AlapontNovel Smart Contracts Model for the Infrastructure Sector: A Design Science Research Framework2021B.Sc. IEM
Vaibhav AprajA Proof of Concept for an incrementally enhanced Data Quality Dashboard2021M.Sc. DE
Yanny BudiakiReduction of food loss along the Sub-Saharan African food supply chain: The application of a sharing economy product2021B.Sc. IEM
Andrea Pin MoralesThe Environmental Impact of Increased Transparency In Last Mile Delivery Emissions in E commerce A study on the effect of a sustainability driven point system on customer behavior2020B.Sc. IEM
Atit BashyalMoving Beyond Maximum Likelihood Estimates Application of Bayesian Inference techniques in Poverty Prediction.2020M.Sc. DE
Daniyar AbdimomunovOpportunities and Challenges of Data-driven Applications Systematic comparison and validation of academic literature from the perspective of data users2020B.Sc. IEM
Daoyuan JiCollaborative Distribution Model between Logistics Service Providers and Public Transportation Operators: Investigation of parcel Movement in the Public Transit Network with Heuristics Approach2020M.Sc. SCEM
Helen SchmitzHow does bonus programs affect the customer behaviour towards sustainable packaging?
A survey-based customer behaviour analysis
2020B.Sc. IEM
Kataryna TaranCurrent state of Facilities Management industry: Development and implementation of new standards, technologies, and strategies around the world2020M.Sc. SCEM
Merint Thomas MathewMonitoring Customer Activity and Forecasting Future Deals for improved Customer Success Management2020M.Sc. DE
Nathnael TayeEnergy efficiency and Digitalization: How to improve awareness on energy efficiency in residential buildings using digitalization?2020B.Sc. IEM
Paola CevallosBest Practices of Digitalization within Motorsport Events2020B.Sc. IEM
Paolo de LeonHow Industry 4.0 Secures the Future of the Shipbuilding Industry Data Analysis on the implementation of new IoT technologies to the Shipbuilding Industry2020B.Sc. IEM
Peter-Sleiman MansourSupplier selection model for sustainable supply chain in low-cost robotics production2020M.Sc. SCEM
Saurav KediaCarpooling in Nepal: User Prospective based on Sustainability:
A survey-based approach
2020B.Sc. IEM
Sebastian ReiserDynamic Electricity Pricing Comparative Study of different forecasting methods 2020B.Sc. IEM
Surendra KetkarQuality Control in Smart Manufacturing for SMEs2020M.Sc. SCEM
Yousuf FarooqCan Claiming Sustainability harm companies?
A survey-based approach
2020B.Sc. IEM
Zayed BaloutIndustry 4.0:
An Empirical and Conceptual Analysis Approach to the Implementation and Progression of Digitalization within the Construction Sector
2020M.Sc. SCEM
Amin HoushidariBig data analytics for forecasting in supply chain2019M.Sc. SCEM
Burulai Abdykapar KyzyThe role of artificial intelligence, machine learning and predictive analytics in project management2019M.Sc. SCEM
Cedric OhlmsData and Interface Management for optimization of change processes 2019B.Sc. IEM
Dazhi ZhanSystematic Comparison of Impact and Readiness of Industry 4.0 Technologies between China and Germany2019B.Sc. IEM
Diego LainfiestaCapacity and Risk Planning in a high Mix Product Environment
2019M.Sc. DE
Katharina LandersThe Use of Artificial Intelligence and Internet of Things for Anomaly Detection and Predictive Maintenance for Construction Machinery2019M.Sc. SCEM
Mifrah AftabAnalyzation and Development of Logistics Chain KPIs for Sustainable Ports in Europe.2019M.Sc. SCEM
Milos DelikladicHow Industry 4.0 technologies can be incorporated in future farming? Survey and interview investigation in Serbia 2019B.Sc. IEM
Miruthula Ramesh To what extent can sustainable/green manufacturing benefit from the integration of IoT and Big data?
2019B.Sc. IEM
Murtaza KhawajaThe Role of After Sales in an IoT based Startup 2019B.Sc. IEM
Osama Bin WaheedConstruction 4.0 and smart building Development of the reference process model in construction 4.0 Case study: Sweden and China2019B.Sc. IEM
Rahul UpadhyayIndustry 4.0 in Farming: Precision Agriculture
A case study in Nepal
2019B.Sc. IEM
Sharath Abraham PeterApplication of augmented reality technology in the manufacturing process of pressing tools2019M.Sc. SCEM
Tianran NiAutomated Information System Development for Medium- to Short-term Capacity Planning in MTO Industry2019B.Sc. IEM
Xuqi BaiErgonomic Performance Indicator Development for Assembly Systems: Extension of EAWS Method with Assessment of Environmental Factor2019B.Sc. IEM
Ali NaweedThe Role of Digitalization for transformation to Industrie 4.0 oriented Business Models
in the German Automotive Industry.
2018B.Sc. IEM
Anton Shavtvalishvili Additive Manufacturing’s impact on supply chain
in aerospace industry
2018B.Sc. IEM
Hussein HegazyOutsourcing and Swap practice in the petroleum industry2018B.Sc. IEM
JongHack YiDevelopment of Sales Demand Forecast through process Improvement2018M.Sc. SCEM
Kemal TumyschOutsourcing and Enterprise Resource Planning: Challenges and Opportunity in Petroleum Supply Chain2018B.Sc. IEM
Kirsten MuellerAnalysis of NBA supply chain network for its expansion to Europe 2018M.Sc. SCEM
Osaid KhanAssessing the Maturity Level of Industry 4.0 in the Automotive Industry2018B.Sc. IEM
Sergi Drago GonzalezDo machine learning models provide accurate macroeconomic GDP forecast?2018B.Sc. GEM
Soman TariqReinventing the Manufacturing Sector of Aerospace Industry by Implementing Industry
4.0 Technologies
2018B.Sc. IEM
Waleed AsifImproving Energy and Resource Efficiency in Different Level of Production Using Industry 4.0 Technologies2018B.Sc. IEM
Young Joon LeeProduct Lifecycle Information Model for Reference Architectural Model Industrie 4.0 (RAMI 4.0)2018M.Sc. SCEM
Akarawint ChawalitanontPower Consumption Prediction of Customised Stainless Steel Ring Products using Hierarchical Bayesian Models and Artificial Neural Networks2022M.Sc. DE
Nadhir Khoubeieb MechaiCausal Machine Learning (CML) for Demand Forecasting of Semiconductor Products2022M.Sc. SCM
Xiaotong FanAdvanced Time Series Analysis for Electric Vehicle Demand Forecasting2022M.Sc. SCM
Seyed Taha RaeisiDelivery Assurance and Digitalization Utilizing Expert’s Knowledge, Internal Key Performance Indexes and Exogenous Risk Factors to Drive Operational Supply Chain2022M.Sc. SCM
Vamsi Sai PidikitiManaging interoperability in the circular economy2022M.Sc. SCM
Muhammad Abdullah ShahUsing Machine Learning to Optimize Dynamic Pricing in Production at Machine Level2022B.Sc. IEM
Nurgun Rafizade, Exploration of Radial Basis Function Networks and Adaptive Network Based Fuzzy Inference Systems for Time Series Forecasting2022B.Sc. CS
Hamza HayakExplainable AI for Crop Yield Prediction2022B.Sc. CS
Toska YmerhaliliDeep Reinforcement Learning for Industrial Microgrid Management2022B.Sc. CS
Nguyen Duy BaoEnsemble Learning to forecast Wind/Solar power generation2022B.Sc. IEM
Carlotta BuckIncentivising Voluntary Carbon Offsetting and its Driving Factors: Integrating Causal and Statistical Data Analysis2022B.Sc. IEM
Abel TeklearegayOntology Matching Using Supervised Learning Techniques2022B.Sc. IEM
Christopher MclaughlanQuerying Ontologies using NLP: A Natural Language based Query Interface to OWL Ontologies for Demand-Response System2022B.Sc. CS
Kaleb KristoWhat social-cultural values of products make people shop online? A causal data analytics method2022B.Sc. IEM
Moonkyoung ChooEffect of Governmental Policies and Healthcare Systems on COVID-19 Hosptalization: A Machine Learning Prediction Approach2022B.Sc. IEM
Karen OkoliThe Role of AI in Circular Economy: Automotive industry2022B.Sc. IEM
Kezia AdesanyaMapping JSON data to Existing Ontologies: Populating the ontologies by the Data contents using Python Libraries2022B.Sc. IEM
Mariam BukhrashviliThe impact of Covid-19 on Quality Audits The Future of pharmaceutical quality audits2022B.Sc. IEM
Mohamed GhandourThesis 2022
21. Mohamed Ghandour, Product Portfolio Complexity Management a data-driven approach using ‘Analytical Hierarchy Process’ and the ‘Technique for Order Performance by Similarity to Ideal Solution
2022B.Sc. IEM
Mulisa Kevine UmwizaThe Role of Digital Twin in Sustainable Supply Chain in the Automotive Industry2022B.Sc. IEM
Rauan BeibitCOVID-19 Impact on the Supply Chain Effects of the Semiconductor Shortage on the Automotive Industry2022B.Sc. IEM
Stefan KrstevskiThe Influence of Dynamic Electricity Price on Production Scheduling: A Mixed Integer Linear Programming Model2022B.Sc. IEM
Ylva ReinholdHow Do Industry 4.0 Applications Help in Designing Sustainable Forest Management? A Conceptual Framework of IoT in Novel Sectors2022B.Sc. IEM
Xuchong DuApplication of GNN in supply chain for link prediction and explainability analysis2022B.Sc. CS
Apivut OurairatGap Analysis, State of the Art, Existing Artificial Intelligence
in Agrifood supply chain
2022B.Sc. IEM
Abumansur SabyrrakhimMapping SQL (Pre)-clinical Data
from Cancer Radiooncology and
Radiobiology Studies to Ontology
2022B.Sc. CS
Dhushyanth RavindranSystematic Literature Review and Conceptual Model - Artificial Intelligence in Agri-based Food Supply Chain2023M.Sc. SCM
Mehr Un NisaConceptual model of platform for digital asset management to enable cross-company collaborative digital twin development for transport infrastructure management2023M.Sc. SCM
Vikas KumarLogistic KPI Analysis for Wind Turbine Industries2023M.Sc. SCM
Adriana MartinezDecision-Targeted Operational Supply Chain: A Comparative Study of Machine Learning Approaches for Supplier
Escalation in the Operational Supply Chain Using Multi-Criteria
Decision-Making (MCDM)
2023M.Sc. DSSB
Siyakurima Tammy TacheneswaIdentifying Essential Driving Factors of Digital Transformation Maturity Models Using Fuzzy MCDM Methods:
A Company Based Survey Approach
2023M.Sc. DSSB
Aruzhan NurzhuminaTowards
Ontology based Model for Circular
E conomy S cenarios in Automotive Industry
A Systematic Literature Review Approach
2023B.Sc. IEM
Assiya ZakayCross Company Digital Twin Development in Circular Economy: Conceptual Model for a Digital Asset Management Platform2023B.Sc. IEM
Heinn Thant ZawPredicting Stock Prices and Trend Using
Machine Learning
Data analytics approach using Random Forrest and
Long Short Term Memory
2023B.Sc. IEM
Jiyuan WeiWhy people are willing to work from home in post-COVID-19 era?
A survey and causal analysis approach
2023B.Sc. IEM
John AtanacioExplainable Artificial Intelligence in Crop Yield Prediction
Analyzing the impact of climate change in German crops
2023B.Sc. IEM