Query Library

Rank the English Wiktionary words by the length of their articles

Viewing the first 20 of total 22,846 rows

Download Full Results
wordarticle_word_counturl
А12228https://en.wiktionary.org/wiki/%D0%90
set7745https://en.wiktionary.org/wiki/set
sa7425https://en.wiktionary.org/wiki/sa
stand6248https://en.wiktionary.org/wiki/stand
sin6040https://en.wiktionary.org/wiki/sin
da5753https://en.wiktionary.org/wiki/da
sol5578https://en.wiktionary.org/wiki/sol
5093https://en.wiktionary.org/wiki/%E9%A0%AD
el5005https://en.wiktionary.org/wiki/el
post4841https://en.wiktionary.org/wiki/post
he4753https://en.wiktionary.org/wiki/he
office4735https://en.wiktionary.org/wiki/office
fan4658https://en.wiktionary.org/wiki/fan
dot4620https://en.wiktionary.org/wiki/dot
fa4511https://en.wiktionary.org/wiki/fa
fa4511https://en.wiktionary.org/wiki/fa
pike4477https://en.wiktionary.org/wiki/pike
it4427https://en.wiktionary.org/wiki/it
-s4423https://en.wiktionary.org/wiki/-s
I love you4373https://en.wiktionary.org/wiki/I_love_you

Viewing the first 20 of total 22,846 rows

Download Full Results
Query
select
    css_text_first(content, 'h1#firstHeading') as word,
    cardinality(words(css_text_first(content, '#content'))) as article_word_count,
    url
from 
    pages
where 
    url_domain = 'wiktionary.org' 
        and
    url like 'https://en.wiktionary.org/wiki/%'
        and
    url not like 'https://en.wiktionary.org/wiki/%:%'
order by article_word_count desc
Load in editor
Data scanned1.57 GB
Results1.55 MB (22,846 rows)

Turn the web into a database!

Mixnode is a fast, flexible and massively scalable platform to extract and analyze data from the web.

or contact us at hi@mixnode.com