Sketch Engine
@SketchEngine
Sketch Engine is a corpus query system with text analysis tools for text corpora in 100+ languages.
ID:841197134
https://sketchengine.eu 23-09-2012 07:14:34
951 Tweets
3,7K Followers
314 Following
Still in Spain!🇪🇸 Meet our expert in person at #GALA2024 in Valencia! Drop by our booth and chat with him about how Sketch Engine can level up your translation. And yes, Ondřej's there for real - he snapped this pic himself! #TranslationTech #Localization #languageindustry
Looking for an Arabic corpus? Try our Arabic Corpus 2024 with a free 30-day trial and it might become your top pick for comprehensive text analysis, linguistic research, and #NLP . sketchengine.eu/artenten-arabi… #corpuslinguistics #TextAnalysis #computationallinguistics
Sketch Engine spricht schon wieder Deutsch! We've completed the German translation of the interface. Huge thanks to the diligent work of German students Christopher, Mona, and Jonas from the Translatology program at UNIVERSITÄT LEIPZIG, and our colleague František.
#corpuslinguistics
Large, larger, the largest! We have created a colossal corpus. This English monitor corpus automatically grows by 70 million words every week. Thanks to publication dates, users can study #WordUsage changes, trends, and neologisms. sketchengine.eu/english-trends…
#corpuslinguistics
Don't miss Lexicom 2024 in Málaga, Spain. Immerse yourself in #lexicography and #corpuslinguistics . Discover new techniques including the usage of #AI for building dictionaries, over five days of hands-on experience, and expert-led sessions. lexicom.courses/lexicom-2024-m…
Study changes in Czech with our Czech monitor corpus growing by 2 million words daily. This 1.7-billion-word corpus enables you to use Trends, the #DIACHRONIC analysis tool, and study #WordUsage changes and neologisms.👉 sketchengine.eu/czech-trends-c…
#corpuslinguistics
Sketch Engine offers access to hundreds of parallel corpora, including a selection of Arabic texts. Explore United Nations documents, OpenSubtitles texts, the OPUS collection of multilingual data, and more. #arabiclanguage #appliedlinguistics #corpuslinguistics
Dr Mary Jacob L&T @maryjacob.bsky.social The most underappreciated tools for analysing textual data are corpus analysis tools such as Sketch Engine - they will show you many useful things about your data before you start coding. They are useful for reviewing LLM outputs, as well. Otherwise go old-school, ie Python or R.
How does human-inspired automatic term extraction work? Read a text published by the journal of the German Terminology Association DTT e. V..
👉 dttev.org/images/edition…
#terminology #Terminologie #termextraction
We want to bring your attention to Lexicom 2024 in Málaga, Spain. The content of this course covers #lexicography , #corpuslinguistics , and #NLP including #LLM models such as ChatGPT for dictionary tasks. lexicom.courses/lexicom-2024-m…
Also in 2023, we extended our list of corpora. So we're now reaching almost 800 corpora in total. The largest corpus: English Web 2021 with 52 billion words. New languages: Assamese, Bashkir, and Northern Kurdish.
sketchengine.eu/corpora-and-la…
#corpuslinguistics #LanguageData
Ani našich slovenských bratov nenecháme bez nového korpusu. Veď takmer polovica tímu okolo Sketch Engine je zo Slovenska 🙂🇸🇰 Tu je nový slovenský webový korpus 2023, opäť doplnený o anotáciu žánrov a klasifikáciu tém.
sketchengine.eu/sktenten-slova…
#corpuslinguistics #digitalhumanities