Uncategorized

machine learning – Group similar but exact words and their abbreviation in python


I have a question regarding grouping of similar words and their abbreviation into one group for example I have list of words give below:

  1. artificialintelligence
  2. Artificial Intelligence
  3. AI
  4. Machine Learning
  5. ML
  6. Data Analytics
  7. Data & Analytics

I want to group these words into [Artificial intelligence, Machine Learning, Data Analytics]

I have used difflib.get_close_matches() but that does not give me desired result For example this is how difflib group: Information Technology’: [‘Information Technology’,’Mobile Technology’, ‘newtechnology’]

I have also used fuzz.token_set_ratio() but that also does not provide me the desired result. Neither Levenshtein does.

If there is any Machine learning algorithm or any python library please let me know.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *