Website and product categorizations
What Is Website Categorization?
Website categorization is a task of classifying website into one of predefined categories, also called taxonomies. Usually this is done by a supervised text classification machine learning model, because in deployment to production one often needs to classify a large number of texts.
Typical website categories
How many categories are in the taxonomy depends on the problem. E.g. in ecommerce setting the top Tier 1 level of categorization usually has 21 categories:
Apparel & Accessories | 226 |
Home & Garden | 115 |
Sporting Goods | 50 |
Health & Beauty | 46 |
Hardware | 37 |
Electronics | 30 |
Animals & Pet Supplies | 25 |
Office Supplies | 19 |
Food, Beverages & Tobacco | 13 |
Toys & Games | 13 |
Business & Industrial | 10 |
Baby & Toddler | 6 |
Luggage & Bags | 6 |
Arts & Entertainment | 4 |
Software | 4 |
Furniture | 4 |
Religious & Ceremonial | 3 |
Mature | 2 |
Cameras & Optics | 2 |
Media | 1 |
Vehicles & Parts | 1 |
Then, on lower Tiers, the google product taxonomy has 190+ categories on Tier 2 and 1000+ categories on Tier 3.
Most usually website categorization is available as API or tool. In this way one can easily integrate it in own products and services.