j-higgins commited on
Commit
144df59
·
verified ·
1 Parent(s): c20277c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -6
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  title: KeyIntentNER T
3
  emoji: ♾️
4
- colorFrom: red
5
- colorTo: yellow
6
  sdk: docker
7
  pinned: true
8
  license: apache-2.0
@@ -29,11 +29,29 @@ Utilizes GLiNER, an advanced Named Entity Recognition (NER) model, to classify s
29
  Matches keywords to topics from Google's well-known Content and Product taxonomies.
30
 
31
  ## Usage
32
- Enter a list of keywords (one per line, up to 100 MAX) and click the submit button. Keyword processing can take anywhere from 30 seconds up to ~2 minutes due to the extensive analysis performed behind the scenes. Once processing is complete, you can download any of the bar chart images and download a CSV export with insights for all keywords.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
 
34
  ## Benefits for SEO
35
- Improved Content Strategy
36
- Focus your SEO efforts on creating more relevant and helpful content that addresses the search intent for keywords.
37
 
38
  ### Enhanced Keyword Targeting
39
  Match keywords to Google's well-known categories, ensuring your content is aligned with popular search themes.
@@ -41,10 +59,35 @@ Match keywords to Google's well-known categories, ensuring your content is align
41
  ### Better Understanding of User Intent
42
  Gain insights into what kind of information a person is looking for and how keywords can be interpreted by search engines.
43
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
44
  #### GLiNER Model Citation
45
  - GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer.
46
  - Authors: Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois.
47
  - Year: 2023.
48
  - Link: [arXiv:2311.08526](https://arxiv.org/abs/2311.08526)
49
 
50
- For questions or if you are interested in building custom SEO dash apps, contact me at: jrad.seo@gmail.com
 
1
  ---
2
  title: KeyIntentNER T
3
  emoji: ♾️
4
+ colorFrom: yellow
5
+ colorTo: red
6
  sdk: docker
7
  pinned: true
8
  license: apache-2.0
 
29
  Matches keywords to topics from Google's well-known Content and Product taxonomies.
30
 
31
  ## Usage
32
+ - Enter a list of keywords (one per line, up to 100 MAX) and click the submit button.
33
+ - Keyword processing can take anywhere from 30 seconds up to ~2 minutes due to the extensive analysis performed behind the scenes.
34
+ - Once processing is complete, you can download any of the bar chart plots and download a CSV export with insights for all keywords.
35
+
36
+ Example keywords:
37
+ ```
38
+ Standing desks vs. regular desks
39
+ car repair service my area
40
+ Buy groceries online with delivery
41
+ average cost of car insurance
42
+ top rated laptops 2024
43
+ Book a flight to Hawaii
44
+ how to fix a leaking faucet
45
+ ```
46
+
47
+ #### Example Plot
48
+ ![KeyIntentNER-T_Example Plot](./images/keyintentner_t_example_plot.png)
49
+
50
+ #### Example Plot
51
+ ![KeyIntentNER-T_Example Plot](./images/keyintentner_t_example_table.png)
52
 
53
  ## Benefits for SEO
54
+ Improved content strategy by focusing your SEO efforts on creating more relevant/helpful content that addresses the search intent for keywords.
 
55
 
56
  ### Enhanced Keyword Targeting
57
  Match keywords to Google's well-known categories, ensuring your content is aligned with popular search themes.
 
59
  ### Better Understanding of User Intent
60
  Gain insights into what kind of information a person is looking for and how keywords can be interpreted by search engines.
61
 
62
+ ### Notes on Data:
63
+ - GLiNER Model [gliner_small-v2.1](https://huggingface.co/urchade/gliner_small-v2.1) was trained on the [urchade/pile-mistral-v0.1](https://huggingface.co/datasets/urchade/pile-mistral-v0.1) dataset
64
+ - Keyword Intent is determined by a custom function that looks for the presence of specific terms in keywords and classifies them into one of six predefined intent categories. The function includes popular "keyword modifiers" but a more customized list will provide better matches or using other ML tools to identify intent assignment.
65
+ - Keyword Intent Classification is performed using a custom rule-based algorithm. This algorithm scans keywords for specific terms and patterns, categorizing them into one of six predefined intent categories. While the current implementation includes common "keyword modifiers," the accuracy can be significantly improved by:
66
+ - Expanding and customizing the list of intent-indicating terms
67
+ - Implementing more sophisticated machine learning models for intent recognition
68
+ - Topic Modeling uses both [Google Content Categories](https://cloud.google.com/natural-language/docs/categories) & [Google Product Taxonomy](https://www.google.com/basepages/producttype/taxonomy.en-US.txt)
69
+ - GLiNER is able to use a large set of entity types, but for the purposes of this demo it is limited to the following [Google NLP Entity Types](https://cloud.google.com/natural-language/docs/reference/rest/v2/Entity#type):
70
+
71
+ ```
72
+ PERSON : "person"
73
+ LOCATION : "location"
74
+ ORGANIZATION : "organization"
75
+ EVENT : "event"
76
+ WORK_OF_ART : "work_of_art"
77
+ CONSUMER_GOOD : "product" and "service"
78
+ OTHER : "misc"
79
+ PHONE_NUMBER : "phone_number"
80
+ ADDRESS : "address"
81
+ DATE : "date"
82
+ NUMBER : "number"
83
+ PRICE : "price"
84
+ ```
85
+ - Due to the limitations of hosting this in a free space, a smaller sentence transformers model is used which does not perform as well with some of the Topic Modeling categories. In testing, the [all-roberta-large-v1 model](https://huggingface.co/sentence-transformers/all-roberta-large-v1) performed best for sample keywords tested.
86
+
87
  #### GLiNER Model Citation
88
  - GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer.
89
  - Authors: Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois.
90
  - Year: 2023.
91
  - Link: [arXiv:2311.08526](https://arxiv.org/abs/2311.08526)
92
 
93
+ For questions or if you are interested in building custom SEO dash apps, contact me at: jrad.seo@gmail.com