Informatik

Informatik https://kobra.uni-kassel.de:443/handle/123456789/2006051211619 2024-09-11T10:51:26Z Ad-hoc Komposition optimaler Verarbeitungsketten für die Informationsextraktion aus heterogenen Produktpreisblättern https://kobra.uni-kassel.de:443/handle/123456789/15917 Markttransparenz stabilisiert den Wettbewerb einer Branche, fördert Innovationen und schützt Endverbraucher vor hohen Kosten. Zur Schaffung dieser Transparenz bedarf es unabhängiger Unternehmen, die Informationen über die verschiedenen Anbieter und Produkte des fokussierten Markts sammeln. In vielen Branchen veröffentlichen die Anbieter dabei ihre Produktinformationen in individuellen Produktpreisblättern (PPB) und stellen diese in Form nicht maschinenlesbarer PDF-Dokumente auf ihren Webseiten zur Verfügung. Daraus resultieren zeitaufwändige und teure Prozesse zur Informationsextraktion (IE) bei den unabhängigen Unternehmen. Technisch betrachtet ist die Automatisierung der IE aus nicht maschinenlesbaren PDF-Dokumenten sehr komplex. Sie erfordert die Lösung verschiedener Teilaufgaben, wie beispielsweise die Erkennung von Tabellen und die semantische Analyse von Text. Entsprechend müssen unterschiedliche Lösungsbausteine entwickelt und in zusammenhängende Verarbeitungsketten komponiert werden. Die optimale Verarbeitungskette für ein unbekanntes Eingangsdokument hängt dabei von dessen Format ab. Häufige Änderungen an den PPB sowie der kontinuierliche technische Fortschritt erzeugen ein hochdynamisches Problemumfeld, das Flexibilität, Erweiterbarkeit und Anpassbarkeit der Verarbeitungsketten erfordert. Die vorliegende Arbeit stellt ein Framework vor, das die Implementierung selbstadaptiver IE-Systeme ermöglicht und somit Unternehmen dabei unterstützt, sukzessive ihre manuellen Prozesse zur Erfassung relevanter Informationen zu automatisieren. Das Framework ermöglicht die Komposition flexibler Verarbeitungsketten mit austauschbaren Lösungsbausteinen, die von unterschiedlichen Spezialisten implementiert werden können. Darüber hinaus gewährleistet es die automatische Ermittlung optimaler Verarbeitungsketten für diverse Dokumentformate. Der Implementierung liegt eine verteilte Microservice Architektur (MSA) zugrunde, welche die kontinuierliche Anpassbarkeit und Erweiterbarkeit des Gesamtsystems gewährleistet. Dabei werden die einzelnen Lösungsbausteine in Form autarker Microservices implementiert, was die Nutzung problemspezifisch optimaler Programmiersprachen und Bibliotheken für die Entwickler der Lösungsbausteine ermöglicht. Die Aufteilung der Extraktion fachlicher Datenobjekte in die Extraktion disjunkter Teilinformationen unterstützt eine sukzessive Automatisierung der zugrundeliegenden Prozesse. Die Integration eines optionalen Prüfverfahrens der Extraktionsergebnisse durch Domänenexperten stellt darüber hinaus die Einhaltung unternehmerischer Qualitätsanforderungen sicher. Das Framework wurde durch den erfolgreichen Einsatz bei einem Kooperationspartner evaluiert, der Informationen zu Stromgrundversorgungstarifen in Deutschland erfasst. Dabei zeigte sich, dass das Framework die sukzessive Automatisierung der IE durch die kontinuierliche Integration neuer Lösungsbausteine unterstützt. Der manuelle Aufwand bei der Erfassung von Arbeits- und Grundpreisen wurde um 60% reduziert. Zudem konnten 7% aller Dokumente vollständig automatisiert verarbeitet werden.; Market transparency stabilizes competition in an industry, promotes innovation and protects consumers from high costs. To create transparency, independent companies have to collect information about the various suppliers and products in the focused market. In many industries, suppliers publish their product information in individual product price sheets on their websites as non-machine-readable PDF documents. This results in time-consuming and expensive information extraction (IE) processes. Technically, automating IE from non-machine readable PDF documents is very complex. It requires the solution of various subtasks, such as table recognition and semantic analysis of text. Accordingly, different solution components must be developed and composed into coherent processing chains. The optimal processing chain for an unknown input document depends on its format. Frequent changes to product price sheets and continuous technical progress create a highly dynamic problem environment that requires flexibility, extensibility and adaptability of the processing chains. This thesis presents a framework that enables the implementation of self-adaptive IE-systems, and thus supports companies in successively automating their manual processes for capturing relevant information. The framework enables the development of flexible processing chains, which allow the exchange of individual solution components that different specialists can implement. Furthermore, it ensures the automatic determination of optimal processing chains for diverse document formats. The implementation is based on a distributed microservice architecture (MSA), ensuring the continuous adaptability and expandability of the overall system. The individual solution components are implemented in the form of self-sufficient microservices, which enable the use of problem-specific optimal programming languages and libraries. The separation of the extraction of domain-oriented data objects into the extraction of disjoint partial information supports successive automation of the underlying business processes. Integrating an optional review step of the extraction results by domain experts also ensures compliance with corporate quality requirements. The framework was evaluated through its application at a cooperation company partner that collects information on basic electricity supply tariffs in Germany. It was shown that the framework supports the successive automation of IE through the continuous integration of new solution components. The manual effort for extracting prices was reduced by 60%. Furthermore, a potential of fully automatic document processing of 7% was discovered. 2024-01-01T00:00:00Z Jentgens, Michael Markttransparenz stabilisiert den Wettbewerb einer Branche, fördert Innovationen und schützt Endverbraucher vor hohen Kosten. Zur Schaffung dieser Transparenz bedarf es unabhängiger Unternehmen, die Informationen über die verschiedenen Anbieter und Produkte des fokussierten Markts sammeln. In vielen Branchen veröffentlichen die Anbieter dabei ihre Produktinformationen in individuellen Produktpreisblättern (PPB) und stellen diese in Form nicht maschinenlesbarer PDF-Dokumente auf ihren Webseiten zur Verfügung. Daraus resultieren zeitaufwändige und teure Prozesse zur Informationsextraktion (IE) bei den unabhängigen Unternehmen. Technisch betrachtet ist die Automatisierung der IE aus nicht maschinenlesbaren PDF-Dokumenten sehr komplex. Sie erfordert die Lösung verschiedener Teilaufgaben, wie beispielsweise die Erkennung von Tabellen und die semantische Analyse von Text. Entsprechend müssen unterschiedliche Lösungsbausteine entwickelt und in zusammenhängende Verarbeitungsketten komponiert werden. Die optimale Verarbeitungskette für ein unbekanntes Eingangsdokument hängt dabei von dessen Format ab. Häufige Änderungen an den PPB sowie der kontinuierliche technische Fortschritt erzeugen ein hochdynamisches Problemumfeld, das Flexibilität, Erweiterbarkeit und Anpassbarkeit der Verarbeitungsketten erfordert. Die vorliegende Arbeit stellt ein Framework vor, das die Implementierung selbstadaptiver IE-Systeme ermöglicht und somit Unternehmen dabei unterstützt, sukzessive ihre manuellen Prozesse zur Erfassung relevanter Informationen zu automatisieren. Das Framework ermöglicht die Komposition flexibler Verarbeitungsketten mit austauschbaren Lösungsbausteinen, die von unterschiedlichen Spezialisten implementiert werden können. Darüber hinaus gewährleistet es die automatische Ermittlung optimaler Verarbeitungsketten für diverse Dokumentformate. Der Implementierung liegt eine verteilte Microservice Architektur (MSA) zugrunde, welche die kontinuierliche Anpassbarkeit und Erweiterbarkeit des Gesamtsystems gewährleistet. Dabei werden die einzelnen Lösungsbausteine in Form autarker Microservices implementiert, was die Nutzung problemspezifisch optimaler Programmiersprachen und Bibliotheken für die Entwickler der Lösungsbausteine ermöglicht. Die Aufteilung der Extraktion fachlicher Datenobjekte in die Extraktion disjunkter Teilinformationen unterstützt eine sukzessive Automatisierung der zugrundeliegenden Prozesse. Die Integration eines optionalen Prüfverfahrens der Extraktionsergebnisse durch Domänenexperten stellt darüber hinaus die Einhaltung unternehmerischer Qualitätsanforderungen sicher. Das Framework wurde durch den erfolgreichen Einsatz bei einem Kooperationspartner evaluiert, der Informationen zu Stromgrundversorgungstarifen in Deutschland erfasst. Dabei zeigte sich, dass das Framework die sukzessive Automatisierung der IE durch die kontinuierliche Integration neuer Lösungsbausteine unterstützt. Der manuelle Aufwand bei der Erfassung von Arbeits- und Grundpreisen wurde um 60% reduziert. Zudem konnten 7% aller Dokumente vollständig automatisiert verarbeitet werden. Market transparency stabilizes competition in an industry, promotes innovation and protects consumers from high costs. To create transparency, independent companies have to collect information about the various suppliers and products in the focused market. In many industries, suppliers publish their product information in individual product price sheets on their websites as non-machine-readable PDF documents. This results in time-consuming and expensive information extraction (IE) processes. Technically, automating IE from non-machine readable PDF documents is very complex. It requires the solution of various subtasks, such as table recognition and semantic analysis of text. Accordingly, different solution components must be developed and composed into coherent processing chains. The optimal processing chain for an unknown input document depends on its format. Frequent changes to product price sheets and continuous technical progress create a highly dynamic problem environment that requires flexibility, extensibility and adaptability of the processing chains. This thesis presents a framework that enables the implementation of self-adaptive IE-systems, and thus supports companies in successively automating their manual processes for capturing relevant information. The framework enables the development of flexible processing chains, which allow the exchange of individual solution components that different specialists can implement. Furthermore, it ensures the automatic determination of optimal processing chains for diverse document formats. The implementation is based on a distributed microservice architecture (MSA), ensuring the continuous adaptability and expandability of the overall system. The individual solution components are implemented in the form of self-sufficient microservices, which enable the use of problem-specific optimal programming languages and libraries. The separation of the extraction of domain-oriented data objects into the extraction of disjoint partial information supports successive automation of the underlying business processes. Integrating an optional review step of the extraction results by domain experts also ensures compliance with corporate quality requirements. The framework was evaluated through its application at a cooperation company partner that collects information on basic electricity supply tariffs in Germany. It was shown that the framework supports the successive automation of IE through the continuous integration of new solution components. The manual effort for extracting prices was reduced by 60%. Furthermore, a potential of fully automatic document processing of 7% was discovered. Object Detection for Automotive Radar Perception https://kobra.uni-kassel.de:443/handle/123456789/15858 Automated vehicles are among the biggest trends in the automotive industry. The desired level of automation slowly progresses from advanced driver assistance system functions to fully autonomous driving. Excellent environmental perception is a critical requirement in this development. This thesis focuses on solutions to the challenges that come with the utilization of automotive radar systems for road user recognition. Therefore, several machine learning techniques are applied and compared to detect and classify moving road users in automotive radar point clouds. An overview of radar processing is given to provide information on how to utilize and interpret the data properly. All methods are evaluated on publicly available real-world data sets. To facilitate the creation of such data sets, a system for automating the associated labeling process is introduced. The detection and classification concepts start with classical modularized approaches that use a clustering algorithm, followed by a feature extraction stage and a conventional classifier. Several techniques that improve these traditional methods are proposed and evaluated, e.g., by utilizing recurrent neural network ensembles or advanced multi-stage clustering. Then, a transition is made from modularized concepts to more self-contained models enabled by modern end-to-end deep learning methods that combine the localization, the feature extraction, and the classification stages in a single model. The developed methods are applied in two case studies, which show how automotive radar can detect non-line-of-sight objects around corners and how next-generation radar sensors impact the accuracy of radar detection systems.; Automatisierte Fahrzeuge sind einer der größten Trends der Automobilindustrie. Der Grad der gewünschten Automatisierung variiert von fortgeschrittenen Fahrerassistenzsystemen bis hin zu voll autonom fahrenden Fahrzeugen. Eine exzellente Umgebungserfassung nimmt hierbei einen immer wichtigeren Stellenwert ein. Die vorliegende Arbeit beschäftigt sich mit Lösungen für die Herausforderungen, die durch die Verwendung von Radarsystemen zur Erkennung von Verkehrsteilnehmer_innen entstehen. Der Fokus liegt auf der Identifikation bewegter Verkehrsteilnehmer_innen in Radarpunktwolken unter Zuhilfenahme von Techniken aus dem Bereich des maschinellen Lernens. Hierfür wird eine Einführung in die Prozessierung und Interpretation von Radardaten gegeben. Alle Methoden werden auf öffentlich zugänglichen Datensätzen evaluiert. Für die Erstellung solcher Datensätze wird ein System zur Automatisierung des Annotationsprozesses vorgestellt. Zur Objektlokalisation und Klassifikation werden zunächst modulare Clustering-Algorithmen, gefolgt von einem Merkmals-Extraktor und einem konventionellen Klassifikator verwendet. Es werden zahlreiche Techniken zur Verbesserung dieser Einzelkomponenten vorgestellt, beispielsweise durch die Verwendung rekurrenter neuronaler Netze oder mehrstufige Clustering-Verfahren. Zusätzlich wird gezeigt, wie mit komplexen eigenständigen Systemen mehr und mehr dieser Teilkomponenten in einem einzigen Model gebündelt werden, sodass sie in einem einzigen Schritt optimiert werden können. Anhand der entwickelten Methoden werden zwei Fallstudien beschrieben. Diese zeigen, wie man Radarsensoren verwenden kann, um verdeckte Verkehrsteilnehmer_innen hinter Ecken zu detektieren bzw. wie sich moderne Radarsensoren der nächsten Generation auf die Objekterkennung mittels solcher Sensorik auswirken. 2024-01-01T00:00:00Z Scheiner, Nicolas Simon Automated vehicles are among the biggest trends in the automotive industry. The desired level of automation slowly progresses from advanced driver assistance system functions to fully autonomous driving. Excellent environmental perception is a critical requirement in this development. This thesis focuses on solutions to the challenges that come with the utilization of automotive radar systems for road user recognition. Therefore, several machine learning techniques are applied and compared to detect and classify moving road users in automotive radar point clouds. An overview of radar processing is given to provide information on how to utilize and interpret the data properly. All methods are evaluated on publicly available real-world data sets. To facilitate the creation of such data sets, a system for automating the associated labeling process is introduced. The detection and classification concepts start with classical modularized approaches that use a clustering algorithm, followed by a feature extraction stage and a conventional classifier. Several techniques that improve these traditional methods are proposed and evaluated, e.g., by utilizing recurrent neural network ensembles or advanced multi-stage clustering. Then, a transition is made from modularized concepts to more self-contained models enabled by modern end-to-end deep learning methods that combine the localization, the feature extraction, and the classification stages in a single model. The developed methods are applied in two case studies, which show how automotive radar can detect non-line-of-sight objects around corners and how next-generation radar sensors impact the accuracy of radar detection systems. Automatisierte Fahrzeuge sind einer der größten Trends der Automobilindustrie. Der Grad der gewünschten Automatisierung variiert von fortgeschrittenen Fahrerassistenzsystemen bis hin zu voll autonom fahrenden Fahrzeugen. Eine exzellente Umgebungserfassung nimmt hierbei einen immer wichtigeren Stellenwert ein. Die vorliegende Arbeit beschäftigt sich mit Lösungen für die Herausforderungen, die durch die Verwendung von Radarsystemen zur Erkennung von Verkehrsteilnehmer_innen entstehen. Der Fokus liegt auf der Identifikation bewegter Verkehrsteilnehmer_innen in Radarpunktwolken unter Zuhilfenahme von Techniken aus dem Bereich des maschinellen Lernens. Hierfür wird eine Einführung in die Prozessierung und Interpretation von Radardaten gegeben. Alle Methoden werden auf öffentlich zugänglichen Datensätzen evaluiert. Für die Erstellung solcher Datensätze wird ein System zur Automatisierung des Annotationsprozesses vorgestellt. Zur Objektlokalisation und Klassifikation werden zunächst modulare Clustering-Algorithmen, gefolgt von einem Merkmals-Extraktor und einem konventionellen Klassifikator verwendet. Es werden zahlreiche Techniken zur Verbesserung dieser Einzelkomponenten vorgestellt, beispielsweise durch die Verwendung rekurrenter neuronaler Netze oder mehrstufige Clustering-Verfahren. Zusätzlich wird gezeigt, wie mit komplexen eigenständigen Systemen mehr und mehr dieser Teilkomponenten in einem einzigen Model gebündelt werden, sodass sie in einem einzigen Schritt optimiert werden können. Anhand der entwickelten Methoden werden zwei Fallstudien beschrieben. Diese zeigen, wie man Radarsensoren verwenden kann, um verdeckte Verkehrsteilnehmer_innen hinter Ecken zu detektieren bzw. wie sich moderne Radarsensoren der nächsten Generation auf die Objekterkennung mittels solcher Sensorik auswirken. Algorithms for Emotion Recognition https://kobra.uni-kassel.de:443/handle/123456789/15856 Technological advancements have increasingly facilitated emotion recognition through physiological sensors integrated into intelligent devices, such as earables and wristbands. People commonly wear these devices in their everyday lives (i.e., in the wild). Patterns can be extracted from various physiological signals, enabling the recognition of emotions. This capability can be integrated into diverse applications, such as attention management systems, human-robot interaction, and stress detection, enhancing them to support humans with an empathic component. This thesis addresses the algorithmic design, adaptation, and utilization of emotion recognition with physiological signals for the mentioned applications. Three key research gaps are identified and addressed: 1) the acquisition of accurate physiological data for emotion recognition in the wild, 2) the improvement of the emotion recognition process, and 3) the need for ethical standards that accompany performing emotion recognition using physiological sensors in the wild. By addressing the improvement of the emotion recognition process, this thesis focuses on mitigating the impact of physical activity on physiological sensor data measured outside the laboratory. Classification algorithms are trained on new mathematical features with data affected by physical activity to create enhanced emotion recognition models, achieving a classification accuracy of up to 73%. Finally, it is demonstrated that emotion recognition can enrich applications like attention management systems by finding opportune moments for interruptions based on smartphone notification response time prediction.; Der technologische Fortschritt erleichtert zunehmend die Erkennung von Emotionen durch physiologische Sensoren, indem diese Sensoren immer häufiger in intelligente Geräte verbaut werden, die Menschen im Alltag nutzen. Beispiele hierfür sind Kopfhörer und Armbänder, die den Puls erfassen können. Aus verschiedenen physiologischen Signalen können Muster extrahiert werden, welche die Erkennung von Emotionen ermöglichen. Diese erkannten Emotionen können in Anwendungen integriert werden (z. B. in den Prozess des maschinellen Lernens). Durch diese Integration wird Anwendungen in den Bereichen Aufmerksamkeitsmanagement, Mensch-Roboter- Interaktion und Stresserkennung die Möglichkeit geboten, menschliche Emotionen zu berücksichtigen und somit Menschen empathisch zu unterstützen. In dieser Dissertation wird ein Literaturüberblick über die Methodik zur Erkennung von Emotionen außerhalb des Labors unter Verwendung physiologischer Sensordaten im Kontext von drei Anwendungen gegeben: Aufmerksamkeitsmanagement, Stresserkennung und Mensch-Roboter-Interaktion. Für jede Anwendung wird die Rolle der Emotionserkennung und bestehende Herausforderungen, die mit Emotionserkennung in Verbindung stehen, zusammengefasst. 2023-01-01T00:00:00Z Heinisch, Judith Simone Technological advancements have increasingly facilitated emotion recognition through physiological sensors integrated into intelligent devices, such as earables and wristbands. People commonly wear these devices in their everyday lives (i.e., in the wild). Patterns can be extracted from various physiological signals, enabling the recognition of emotions. This capability can be integrated into diverse applications, such as attention management systems, human-robot interaction, and stress detection, enhancing them to support humans with an empathic component. This thesis addresses the algorithmic design, adaptation, and utilization of emotion recognition with physiological signals for the mentioned applications. Three key research gaps are identified and addressed: 1) the acquisition of accurate physiological data for emotion recognition in the wild, 2) the improvement of the emotion recognition process, and 3) the need for ethical standards that accompany performing emotion recognition using physiological sensors in the wild. By addressing the improvement of the emotion recognition process, this thesis focuses on mitigating the impact of physical activity on physiological sensor data measured outside the laboratory. Classification algorithms are trained on new mathematical features with data affected by physical activity to create enhanced emotion recognition models, achieving a classification accuracy of up to 73%. Finally, it is demonstrated that emotion recognition can enrich applications like attention management systems by finding opportune moments for interruptions based on smartphone notification response time prediction. Der technologische Fortschritt erleichtert zunehmend die Erkennung von Emotionen durch physiologische Sensoren, indem diese Sensoren immer häufiger in intelligente Geräte verbaut werden, die Menschen im Alltag nutzen. Beispiele hierfür sind Kopfhörer und Armbänder, die den Puls erfassen können. Aus verschiedenen physiologischen Signalen können Muster extrahiert werden, welche die Erkennung von Emotionen ermöglichen. Diese erkannten Emotionen können in Anwendungen integriert werden (z. B. in den Prozess des maschinellen Lernens). Durch diese Integration wird Anwendungen in den Bereichen Aufmerksamkeitsmanagement, Mensch-Roboter- Interaktion und Stresserkennung die Möglichkeit geboten, menschliche Emotionen zu berücksichtigen und somit Menschen empathisch zu unterstützen. In dieser Dissertation wird ein Literaturüberblick über die Methodik zur Erkennung von Emotionen außerhalb des Labors unter Verwendung physiologischer Sensordaten im Kontext von drei Anwendungen gegeben: Aufmerksamkeitsmanagement, Stresserkennung und Mensch-Roboter-Interaktion. Für jede Anwendung wird die Rolle der Emotionserkennung und bestehende Herausforderungen, die mit Emotionserkennung in Verbindung stehen, zusammengefasst. Continuous Feature Networks: A Novel Method to Process Irregularly and Inconsistently Sampled Data With Position-Dependent Features https://kobra.uni-kassel.de:443/handle/123456789/15738 Continuous Kernels have been a recent development in convolutional neural networks. Such kernels are used to process data sampled at different resolutions as well as irregularly and inconsistently sampled data. Convolutional neural networks have the property of translational invariance (e.g., features are detected regardless of their position in the measurement domain), which is unsuitable if the position of detected features is relevant for the prediction task. However, the capabilities of continuous kernels to process irregularly sampled data are still desired. This article introduces the continuous feature network, a novel method utilizing continuous kernels, for detecting global features at absolute positions in the data domain. Through a use case in processing multiple spatially resolved reflection spectroscopy data, which is sampled irregularly and inconsistently, we show that the proposed method is capable of processing such data directly without additional preprocessing or augmentation as is needed using comparable methods. In addition, we show that the proposed method is able to achieve a higher prediction accuracy than a comparable network on a dataset with position-dependent features. Furthermore, a higher robustness to missing data compared to a benchmark network using data interpolation is observed, which allows the network to adapt to sensors with a failure of individual light emitters or detectors without the need for retraining. The article shows how these capabilities stem from the continuous kernels used and how the number of available kernels to be trained affects the model. Finally, the article proposes a method to utilize the introduced method as a base for an interpretable model usable for explainable AI. 2023-12-30T00:00:00Z Magnussen, Birk Martin Stern, Claudius Sick, Bernhard Continuous Kernels have been a recent development in convolutional neural networks. Such kernels are used to process data sampled at different resolutions as well as irregularly and inconsistently sampled data. Convolutional neural networks have the property of translational invariance (e.g., features are detected regardless of their position in the measurement domain), which is unsuitable if the position of detected features is relevant for the prediction task. However, the capabilities of continuous kernels to process irregularly sampled data are still desired. This article introduces the continuous feature network, a novel method utilizing continuous kernels, for detecting global features at absolute positions in the data domain. Through a use case in processing multiple spatially resolved reflection spectroscopy data, which is sampled irregularly and inconsistently, we show that the proposed method is capable of processing such data directly without additional preprocessing or augmentation as is needed using comparable methods. In addition, we show that the proposed method is able to achieve a higher prediction accuracy than a comparable network on a dataset with position-dependent features. Furthermore, a higher robustness to missing data compared to a benchmark network using data interpolation is observed, which allows the network to adapt to sensors with a failure of individual light emitters or detectors without the need for retraining. The article shows how these capabilities stem from the continuous kernels used and how the number of available kernels to be trained affects the model. Finally, the article proposes a method to utilize the introduced method as a base for an interpretable model usable for explainable AI.