Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Understanding which words dominate your writing reveals patterns you cannot see by reading alone.
This word frequency counter analyzes your text and ranks every word by how many times it appears, giving you a clear picture of repetition, keyword density, and vocabulary diversity.
Writers use word frequency analysis to catch unconscious repetition. If you unknowingly used "however" twelve times in a 2,000-word article, that pattern is invisible during normal reading but immediately obvious in a frequency table. SEO professionals use it to verify that target keywords appear at the right density without keyword stuffing. Researchers use it to analyze text corpora for linguistic patterns. Students use it to ensure their essays are not over-reliant on basic vocabulary.
Paste any text into this tool and get an instant frequency table showing each unique word, its count, and its percentage of the total text. Sort by frequency to find your most-used words, or search for specific terms to check their density. The analysis runs entirely in your browser with no data sent to any server.
The tool processes your text through several steps to produce the frequency table.
Tokenization. The text is split into individual words based on whitespace and punctuation boundaries. Each resulting token is a candidate word. Punctuation attached to words (periods, commas, quotation marks) is stripped so that "word," and "word" count as the same token.
Normalization. All words are converted to lowercase so that "The" and "the" count as the same word. This prevents capitalization at the beginning of sentences from creating false duplicates. If case-sensitive analysis is needed, you can preprocess your text with our Case Converter to standardize before analysis.
Counting. Each unique normalized word is counted. The result is a frequency table mapping every distinct word to the number of times it appears in the text.
Sorting and display. The table is sorted by frequency (most common words first) by default. You can also sort alphabetically or by percentage. Each entry shows the word, its absolute count, and its percentage of the total word count.
For writers. Repetition is one of the most common weaknesses in writing, and it is one of the hardest to detect during self-editing because your brain skips over familiar words. A frequency analysis objectively reveals which words appear disproportionately. If "important" appears 15 times in 1,000 words, that is 1.5% of the text devoted to a single adjective, which is almost certainly too much. Synonyms, restructured sentences, or simply deleting redundant instances will improve the writing.
Beyond individual word repetition, the overall vocabulary diversity of a text affects its quality. Professional writing typically uses a wider range of vocabulary than casual writing. The type-token ratio (unique words divided by total words) is a rough measure of vocabulary diversity. A type-token ratio below 0.4 in a 1,000-word text suggests limited vocabulary. Above 0.6 suggests good diversity.
For SEO. Keyword density, the percentage of the total text that a specific keyword represents, is a factor in search engine optimization. While search engines have moved far beyond simple keyword counting, having your target keyword appear at an appropriate density signals topical relevance. Most SEO practitioners recommend a primary keyword density of 1% to 2%, meaning the keyword appears 10 to 20 times in a 1,000-word article. Exceeding 3% to 4% risks being flagged as keyword stuffing, which can harm rankings.
Secondary keywords and related terms should also appear naturally throughout the text. A word frequency analysis reveals whether your content covers the expected vocabulary for a topic. If you are writing about mortgage calculators, terms like "interest rate," "monthly payment," "principal," and "amortization" should all appear with reasonable frequency.
For researchers. Corpus linguistics relies heavily on word frequency analysis. Researchers analyze large text collections to identify language patterns, compare writing styles, detect authorship, and study how language changes over time. The most frequent words in any English text are consistently function words: "the," "of," "and," "to," "a," and "in." These words typically account for 25% to 30% of all word tokens in standard English prose.
For education. Teachers analyze student writing for vocabulary development. A student whose essays are dominated by basic words like "good," "bad," "thing," and "stuff" needs vocabulary enrichment. Frequency analysis provides objective evidence for these conversations and can track vocabulary growth over time.
The most frequent words in any English text are almost always function words (also called stop words): "the," "is," "at," "which," "and," "on," "a," "an," "in," "to," "of." These words are grammatically necessary but carry little semantic meaning.
In a typical English text, the top 100 most frequent words account for approximately 50% of all word tokens. The top 10 alone often account for 20% to 25%. This means your frequency table will be dominated by these common words unless you filter them out.
For most practical analyses (SEO keyword checking, repetition detection, vocabulary assessment), you will want to mentally skip past the stop words and focus on the content words: nouns, verbs, adjectives, and adverbs that carry the actual meaning of your text. Some frequency analysis tools offer stop word filtering. If this tool does not filter them by default, focus your analysis on words that appear below the standard function words in the frequency table.
Keyword density is calculated as: (number of times keyword appears / total word count) x 100.
If your target keyword "word counter" appears 15 times in a 1,500-word article, the density is (15 / 1,500) x 100 = 1.0%. This is within the generally recommended range of 0.5% to 2.5%.
Too low (below 0.5%). The keyword may not appear enough for search engines to understand the topic of your page. This can result in the page ranking for unintended terms or not ranking at all.
Optimal range (0.5% to 2.5%). The keyword appears naturally and frequently enough to signal topical relevance without feeling forced.
Too high (above 3%). The text may feel repetitive to readers and risks being flagged by search engines as keyword stuffing. Google’s algorithms are sophisticated enough to detect and penalize this practice.
Note that keyword density alone does not determine rankings. Semantic relevance, content quality, user experience, backlinks, and hundreds of other factors all contribute. But keyword density remains a useful baseline metric for ensuring your content is topically focused. Use our Word Counter to verify total word count for accurate density calculations.
Beyond simple frequency counts, several metrics quantify the diversity and richness of a text’s vocabulary.
Type-Token Ratio (TTR). The number of unique words (types) divided by the total number of words (tokens). A 1,000-word text with 400 unique words has a TTR of 0.4. Higher TTR indicates greater vocabulary diversity. TTR naturally decreases with longer texts because function words repeat frequently, so it is most useful for comparing texts of similar length.
Hapax Legomena. Words that appear only once in the text. A high proportion of hapax legomena indicates a diverse vocabulary with many specialized or uncommon terms. In typical English prose, 40% to 60% of unique words are hapax legomena.
Vocabulary Density. The proportion of content words (nouns, verbs, adjectives, adverbs) to total words. A higher vocabulary density indicates more information-rich text. Academic and technical writing typically has higher vocabulary density than casual or conversational writing.
These metrics are most useful in comparative contexts: comparing two drafts of the same article, comparing student writing over time, or comparing your content against competitors for the same search query.
Run the analysis after your first draft. Do not optimize while writing. Complete your draft, then use the frequency counter to identify issues. This separates the creative process from the editing process.
Check your target keywords. After analysis, search the frequency table for your primary and secondary SEO keywords. Verify they appear at appropriate densities and are not over- or under-represented.
Identify and replace repetitive words. If a non-function word appears in the top 20 most frequent words, it may be overused. Consider synonyms or restructuring to reduce repetition.
Compare against competitors. Analyze top-ranking content for your target keyword. If competitor articles consistently use certain terms that your content lacks, those terms may be important for topical coverage.
Track vocabulary growth. For students and developing writers, running frequency analysis on successive writing samples tracks vocabulary expansion objectively. Our Character Counter and Words to Pages Converter complement this analysis with length metrics.
Word frequency is the count of how many times each word appears in a text. A word with a frequency of 5 appears five times in the text. Frequency can also be expressed as a percentage of total words.
Keyword density is the percentage of a text that a specific keyword represents, calculated as (keyword count / total words) x 100. For SEO, a density of 0.5% to 2.5% for your primary keyword is generally recommended.
The most frequent English words are function words: "the," "of," "and," "to," "a," "in," "is," "that," "it," and "for." These words appear frequently in virtually all English texts and carry grammatical rather than semantic meaning.
Frequency analysis reveals unconscious repetition that is hard to detect while reading. If a word appears disproportionately often, you can replace instances with synonyms or restructure sentences to improve variety and readability.
This tool counts individual words (unigrams). Phrase frequency analysis (counting two-word or three-word sequences, called bigrams and trigrams) is a more advanced technique used in computational linguistics.
In a 1,000-word text, a type-token ratio above 0.5 generally indicates good vocabulary diversity. Below 0.4 suggests limited vocabulary. TTR decreases naturally in longer texts, so compare only texts of similar length.
Yes. Standalone numbers like "42" or "2024" are counted as individual words. Numbers embedded within words follow the same tokenization rules as regular text.
Focus your analysis on words that appear below the standard function words in the frequency table. The top 10 to 20 entries will typically be common words like "the," "and," "is," and "to." Look past these to find the content words that reveal patterns in your writing.
Data accurate as of: March 2026