Primož Jakopin
Primož Jakopin | |
---|---|
Jakopin in Ljubljana, 2015 | |
Born |
Ljubljana | 30 June 1949
Nationality | Slovenian |
Fields | Computational linguistics |
Institutions | University of Ljubljana, University of Nova Gorica |
Alma mater | University of Ljubljana, University of Zagreb |
Primož Jakopin (born June 30, 1949 in Ljubljana) is a Slovenian computer scientist, most known for his work in the field of language technology.
In 1972 he completed a degree in technical mathematics (Numerično računanje singularnih integralov / Numerical Computation of Singular Integrals) at the University of Ljubljana,[1] in 1981 he obtained his master's degree in information sciences with the thesis Entropija imena i prezimena u Sloveniji / On entropy of first names and last names in Slovenia[2] at the University of Zagreb and in 1999 he got his Ph.D. with the thesis Zgornja meja entropije pri leposlovnih besedilih v slovenskem jeziku / Upper Bound of Entropy in Slovenian Literary Texts,[3] also at the University of Ljubljana.
He was a senior lecturer at the Department of Comparative and General Linguistics, Faculty of Arts, University of Ljubljana. His subjects of instruction are language technologies with stress on Lemmatisation.[4] From 2001 to 2012 he was the Head of the Corpus Laboratory at the Fran Ramovš Institute of Slovene Language (within the Scientific Research Centre of the Slovenian Academy of Sciences and Arts).[5] He participated in a number of European projects on language resources.[6]
His major pieces of software were: IBIS (Digital DEC 10, 1981), INES (Sinclair ZX Spectrum, 1985),[7] STEVE (ATARI ST, 1987-1992),[8] EVA for DOS, 1992- and Windows 9X/2000/XP, 1996-2005), NEVA - Windows server search engine, 1999-2005. From 1992 to 1994 he supervised the transfer of the Standard Slovenian Dictionary (SSKJ) from printed to electronic version (EVA OCR, DOS version). In 1997 he wrote the first part-of-speech tagger for Slovene texts. In 1999 he started an Internet text corpus, with a concordance service and linked wordform and reversed wordform frequency dictionaries. It is now available as Nova beseda (New word).
His father was the Slovene linguist Franc Jakopin, his mother was the poet and translator Gitica Jakopin, his brother Japec Jakopin is a yacht concept designer and his brother Jernej Jakopin is a naval architect.
Publications
- CORTES - a text corpus of Slovene. In publication: Digital resources for the humanities: Conference abstracts (University of Sheffield, 10–13 September 2000). - Sheffield: University of Sheffield, 2000. - p. 70-72. COBISS 16309805
- EVA - an internet tool fr[!] textual and lexical resources. In publication: Linguistics and language studies / 32nd Annual Meeting, Ljubljana, 8–11 July 1999. - Ljubljana : University, Faculty of Arts: Societas Linguistica Europaea, 1999. - p. 98. COBISS 19620397
- The feasibility of a complete text corpus. LREC 2002: proceedings. COBISS 21865773
- On text corpora, word lengths, and word frequencies in Slovene. In publication: Contributions to the science of text and language / edited by Peter Grzybek. - Dordrecht: Springer, 2006. (Text, speech and language technology ; vol. 31). - ISBN 1-4020-4067-9. - p. 171-185. COBISS 24779309
- Query-driven dictionary enhancement. Co-author: Birte Lönneker. In publication: Proceedings of the Eleventh EURALEX International Congress, EURALEX 2004, Lorient, France, July 6–10, 2004 / Geoffrey Williams and Sandra Vessier (eds.). - Lorient : Université de Bretagne-Sud, cop. 2004-. - p. 273-284. COBISS 22533677
- Slovene texts on the internet. In publication: Zapiski: Chronicle of the American Slovene Congress. Issue 7 (May 2000), p. 4-7. COBISS 15997485
- Words and nonwords as basic units of a newspaper text corpus. In publication: COMPLEX 2001 / 6th Conference on Computational Lexicography and Corpus Research "Computational Lexicography and New EU Languages", Mason Hall, Birmingham, 28 June-1 July 2001. - Birmingham: Centre for Corpus Linguistics, Department of English, University of Birmingham, 2001. - p. 49-65. COBISS 16206690
- Entropija v slovenskih leposlovnih besedilih (Upper Bound of Entropy in Slovene Literary Texts), Založba ZRC, Ljubljana 2002. COBISS 121042688
- O oblikoslovnem označevanju slovenskega besedila (Morphological tagging of Slovene texts) (co-author A. Bizjak), Slavistična revija 1997.
- Odzadnji slovar slovenskega jezika (Inverse Dictionary of Slovene language) (co-author M. Hajnšek-Holz), Ljubljana 1996. COBISS 62839552
References
- ↑ Jakopin, Primož (1972). Numerično računanje singularnih integralov: diplomsko delo [Numerical Computation of Singular Integrals] (Diploma thesis). University of Ljubljana. p. 68. Retrieved 5 July 2016.
- ↑ Jakopin, Primož (1981). Entropija imena i prezimena u Sloveniji: magistarski rad [On entropy of first names and last names in Slovenia] (M.Sc. thesis). University of Zagreb. p. 123. Retrieved 5 July 2016.
- ↑ Jakopin, Primož (1999). Zgornja meja entropije pri leposlovnih besedilih v slovenskem jeziku [Upper Bound of Entropy in Slovenian Literary Texts] (Ph.D.). University of Ljubljana. p. 205. Retrieved 5 July 2016.
- ↑ "Primož Jakopin". Department of Comparative and General Linguistics, Faculty of Arts, University of Ljubljana. Ljubljana, Slovenia. Retrieved 27 June 2016.
- ↑ "Assistant Prof. Primož Jakopin, PhD, Senior Research Fellow". Fran Ramovš Institute of the Slovenian Language / Members. Ljubljana, Slovenia. Retrieved 27 June 2016.
- ↑ Heike Rettig, ed. (1995). "Language Resources for Language Technology". Proceedings of the TELRI (Trans-European Language Resources Infrastructure), 1st European Seminar, September 15–16, 1995. Tihany, Hungary. Retrieved 24 June 2016.
- ↑ Jakopin, Primož (1985). "INES: Urejevalnik Podatkov, Slik in Besedil" [INES: Data, Picture and Text Editor]. Sinclair Infoseek. Retrieved 27 June 2016.
- ↑ Jakopin, Primož; Vučkovič, Andrej (1989). "STEVE reference manual : text, graphics, data base, DTP and CAI on Atari ST". OCLC WorldCat. Retrieved 27 June 2016.
External links
- Personal webpage (with selected publications)
- His expert profile at ELSNET's Directory of Language and Speech Technology Experts
- Official webpage at Department of Comparative and General Linguistics, Faculty of Arts, University of Ljubljana