New doctoral dissertation

June 12, 2024
VILNIUS TECH Library invites you to follow the published new dissertations. The dissertation „Internet Web Page Content Block Dataset and Solutions for its Data Labelling Simplification“ prepared by VILNIUS TECH, Kiril Griazev. The dissertation was prepared in 2017–2024. Scientific Consultant – Prof. Dr Simona Ramanauskaitė.

The dissertation will be defended at the public meeting of the Dissertation Defence Council of the Scientific Field of Informatics Engineering in the SRA-I Meeting Hall of Vilnius Gediminas Technical University at 14 a.m. on 12 June 2024.

The dissertation explores the intricacies of identifying, extracting, and documenting content blocks in internet web pages. The research object is the methodologies for these processes to improve the computer perception of online web page data. The primary goal is to conduct an in-depth analysis of datasets containing web page content blocks to enhance their granularity and minimise the volume of blocks requiring manual labelling. The dissertation undertakes several essential tasks: (1) conducting a systematic analysis of the latest research in the field of data extraction from internet web pages; (2) developing a structured dataset for web pages that accommodates a variety of features for different content blocks and is compatible with various data extraction methods; (3) creating a solution for partly automated content block labelling in web pages, which establishes relationships between content blocks and groups them, thereby reducing the need for manual review; (4) evaluating the effectiveness of this developed dataset and labelling solution in identifying, grouping, and establishing relationships between web page content blocks. The dissertation comprises four parts: an introduction, four main chapters, conclusions, references, and appendices. The introduction presents the research problem, significance, objectives, methodology, novelty, practical implications, defended statements, lists of the author’s conference presentations and outlines the dissertation’s structure. The first chapter focuses on Web Mining and examines the challenges and evolution of data extraction and classification techniques. The second chapter explores methods to determine HTML block similarity, considering data and structure. The third chapter details creating a dataset for improved data extraction, highlighting the need for diverse information about block types, features, and structures. The fourth chapter presents advanced methods for identifying HTML content blocks and enhancing content extraction accuracy and efficiency. Several articles were published on the topic discussed in the dissertation: two in publications of the main list of Clarivate Analytics Web of Science and two in the publications of scientific conference proceedings. Research results were presented at three international conferences: 6th Workshop on Advances in Information, Electronic and Electrical Engineering (AIEEE), 2018, Vilnius, Lithuania; Open Conference of Electrical, Electronic and Information Sciences (eStream), 2018, Vilnius, Lithuania; International Conference on Science & Technology (STRA), 2023, Prague, Czech Republic.
 
Doctoral dissertation readers can search via VILNIUS TECH Virtual Library.
 

Related news

New doctoral dissertation
New doctoral dissertation
VILNIUS TECH Library invites you to follow the published new dissertations. The dissertation „Research and application of machine learning methods for migraine attack prediction“ prepared at VILNIUS TECH by Viroslava Kapustynska. The dissertation was prepared in 2021–2026. Scientific consultant – Prof. Dr Šarūnas Paulikas. The dissertation was defended at the public meeting of the Dissertation Defense Council of the Scientific Field of Electrical and Electronic Engineering in the Aula Doctoralis Meeting Hall of Vilnius Gediminas Technical University at 2 p.m. on 9 June 2026. Migraine is a complex neurological disorder characterized by strong inter- and intra-individual variability, which makes early forecasting difficult using only clinical observations. Wearable biosensors combined with machine learning offer new opportunities to detect subtle physiological changes that may precede migraine attacks and to develop individualized prediction models. This dissertation investigates migraine analysis and next-day prediction using physiological recordings collected under real-life monitoring conditions. Data were obtained with the Empatica Embrace Plus wearable device and include electrodermal activity, pulse rate, skin temperature, and movement-related signals. The analysis focuses on nocturnal recordings, since the night period provides a more stable physiological context with fewer external disturbances. Nights were standardized using sleep-based contextual selection and consistent night-level rules. The experimental framework is organized in two stages. In the first stage, a window-level binary classification task is used as an exploratory methodological analysis to examine how design choices influence model performance. Night recordings are segmented into analysis frames ranging from 5 to 120 minutes, statistical features are extracted, and the influence of signal preprocessing and feature representation is evaluated across several classifier families, including Random Forest, XGBoost, histogram-based gradient boosting, support vector machines, and k-nearest neighbors. In the second stage, the research evaluates next-day migraine prediction based on whole-night recordings. This stage refines the experimental methodology to obtain more reliable estimates of predictive performance under a stricter validation framework. The analysis focuses on the effect of temporal aggregation while comparing the same classifier families under consistent evaluation conditions. The results demonstrate considerable variability across participants in achievable prediction performance and optimal modeling configurations. Shorter analysis frames generally preserve informative short-term physiological changes, whereas longer windows tend to smooth these variations. Signal preprocessing shows a window-dependent effect and does not consistently improve performance. Overall, the results highlight the importance of temporal resolution, rigorous validation, and individualized modeling for wearable-based migraine prediction systems. Doctoral dissertation readers can search via VILNIUS TECH Virtual Library.
More
Expert Evaluation: VILNIUS TECH’s Progress Exceeded Expectations
Expert Evaluation: VILNIUS TECH’s Progress Exceeded Expectations
VILNIUS TECH has received a highly positive assessment from international experts. In their recently published conclusions, it is noted that since the 2022 institutional evaluation, the university has achieved significant, evidence-based progress across all four evaluation areas: governance, quality assurance, studies and research activities, and impact on regional and national development. In 2022, VILNIUS TECH was granted a seven-year accreditation. At that time, the expert panel provided the university with 19 recommendations for further improvement. The latest progress review concludes that the university responded to these recommendations responsibly, systematically, and constructively, and that the implemented changes have become part of long-term institutional development. „We are pleased that external experts have highly evaluated the progress achieved by VILNIUS TECH across all four assessment areas. It was noted that the university demonstrates a mature quality culture, a strategic vision, and the ability to consistently sustain growth and increase its impact on society. This ensures that we are entering the next institutional evaluation period with a strong position,“ says Nora Skaburskienė, Director of the Studies Directorate. International experts particularly highlighted the consistently strengthened system of strategic management, the quality culture, active collaboration with business and alumni, leadership within the ATHENA European Universities Alliance, the development of new interdisciplinary study programmes, and significant progress in innovation and technology transfer. The rapid expansion of lifelong learning activities was also noted — VILNIUS TECH has broadened its micro-credential offerings, strengthened partnerships with social and business partners, and is creating favourable conditions for knowledge commercialization and startup development. According to the expert panel, the university has already moved beyond the stage of merely responding to recommendations and is now ready to purposefully leverage its accumulated potential to achieve even higher performance results. In summarizing the evaluation, the experts concluded that VILNIUS TECH is entering the next phase of institutional assessment with a solid foundation for continued successful development.
More