Think Different: Data Ingestion for Retrieval-Augmented Generation (RAG)