Thesis:

The approach integrates:
Clustering Algorithms: HDBScan for identifying patterns in unstructured recipe data.
Interactive Machine Learning: Semi-supervised training with minimal human intervention.
Transformer Models: Fine-tuned BERT and T5 for question generation, answering, and machine reading comprehension.
Linguistic Analysis: Incorporation of POS tagging to improve system precision.