How do find common keywords

saad · May 21, 2020, 8:53am

I have a list of product titles (rows) I want to analyze. Is there a way to find common keywords between these rows?

Here is an example
Row 1: iPhone XS Cover
Row 2: Cover for iPhone S
Row 3: Android Mobile Phone Cover

Is there a way I can find most common keywords among all rows? In this case result would be Cover - Count 3, iPhone - Count 2

brian · May 21, 2020, 4:42pm

Hey Saad!

How many rows do you have? If its not too many, you can use a Column Split, set to split into new ROWS, and set the delimiter to a space.

That will make 1 row per word.

Then you can use the Count Values to count the column of words, and it should give you a count of each word present!

But if you have a lot of rows to start, it will quickly become too many rows.

saad · May 21, 2020, 6:14pm

Brian, I just tried it and it works like it should but I’m not getting the results I was hoping for. I’m trying to analyze Product Titles on multiple stores and find products which are on multiple stores. When I break each product title in to multiple words, the data is gibberish at best. Is there any way to process this to find keywords instead of just words?

John_Doe · May 21, 2020, 9:34pm

Hi saad - Brian may have a better answer since I’m just a Parabola user, but I wanted to make sure you are aware of the concept of “n-gram” analysis as it can be helpful when it comes to grouping words together for analysis.

I briefly searched and found this API endpoint which you may be able to use to turn your strings (e.g. “Android Mobile Phone Cover”) into n-grams (e.g. “Android Mobile”, “Mobile Phone”, “Phone Cover”, etc.) which you could then potentially use a similar process as Brian outlined to find the n-grams with the highest count.

brian · May 26, 2020, 6:07pm

@John_Doe is right - if you want to distinguish between “filler” words and “keywords”, n-grams are pretty much the route to take. You can use an API, or you can try to define rules in Parabola to omit some of the more common filler words.

It will be a tough process, though, and is certainly prone to inaccuracies and subjective results, so keep that in mind!

Topic		Replies	Views
Find keywords in text (from a large list) Ask a question	5	970	December 8, 2020
Combine tables > Match with regex Ask a question Building-Flows	7	642	November 23, 2021
Insert Row between 2 keywords Ask a question Building-Flows	6	429	October 29, 2021
Create new column removing duplicate text Ask a question Building-Flows	4	508	August 31, 2021
Introducing the Find Overlap step Announcements	6	589	August 7, 2020

How do find common keywords

Related topics