Hello, I have a ton of questions posted by users in many discussions, let us assume users started many discussions about a specific topic for eg cakes
and each discussion has 2k posts - every post is a question
100 discussions x 1k posts = 100k posts
(this is just an example I am talking about huge data we can assume anywhere from 500k to 1000k questions (posts))
I can not answer all of these questions and I want to run some kind of query or analysis to find out which questions are more similar / most relevant and I want to group them automatically by a tag or something.
the questions could be asking about ingredients of cake or making process of cake or colours of cake .. there could be many unique questions about the cake. I am aware of the questions but I want all colours
related questions to be grouped in a way to answer them all at once
how can we achieve this? is it really possible?