12th April 2017


Publications in Bulgarian of the BulTreeBank Project


Петя Осенова, Кирил Симов. Формална граматика на българския език. Институт по паралелна обработка на информацията – БАН. София, 18. 12. 2007 г. (Formal Grammar of Bulgarian Language. IPP, BAS.)

Кирил Симов, Петя Осенова. Корпус от синтактични описания на българския език – BulTreeBank. Семинар, СУ “Св. Климент Охридски”. София, 28. 01. 2005 г.

Петя Осенова. Именни групи от типа NN в българския език. Шести национални славистични четения: Славистиката в началото на XXI век. Традиции и очаквания. София, 26 – 27. 04. 2002 г.

Петя Осенова, Кирил Симов. Формално описание на българските звателни форми в рамките на Опорната фразова граматика (Head-driven Phrase Structure Grammar – HPSG). Шести национални славистични четения: Славистиката в началото на XXI век. Традиции и очаквания. София, 26 – 27. 04. 2002 г.

Кирил Симов, Сия Колковска. Интерпретация на да-конструкциите в Опорната фразова граматика. Шести национални славистични четения: Славистиката в началото на XXI век. Традиции и очаквания. София, 26 – 27. 04. 2002 г.




Publications in English of the BulTreeBank Project


Kiril Simov, Petya Osenova. Towards Minimal Recursion Semantics over Bulgarian Dependency Parsing. In the proceedings of the RANLP 2011 Conference. 12th-14th September, Hissar, Bulgaria.

Kiril Simov, Petya Osenova, Laska Laskova, Aleksandar Savkov, Stanislava Kancheva. Bulgarian-English Parallel Treebank: Word and Semantic Level Alignment. In the proceedings of the Second Workshop on Annotation and Exploitation of Parallel Corpora. 15th September, Hissar, Bulgaria.

Petya Osenova, Kiril Simov. Syntactic-Semantic Treebank for Domain Ontology Creation. In: Cognitive Studies/Г‰tudes Cognitives, 11. SOW Publishing House, Warsaw, Poland, 2011. pp 213-225.

Voula Giouli, Kiril Simov, Petya Osenova. A Parallel Greek-Bulgarian Corpus: A Digital Resource of the Shared Cultural Heritage. In: Caroline Sporleder, Antal van den Bosch, Kalliopi Zervanou (eds) Language Technology for Cultural Heritage (Selected Papers from the LaTeCH Workshop Series), edited volumes in Theory and Applications of NLP. Springer. 2011. pp 99-112.

Paola Monachesi, Thomas Markus, Eline Westerhout, Petya Osenova, and Kiril Simov. Supporting formal and informal learning through domain ontologies. e-Education, e-Business, e-Management and e-Learning (IC4E) 2011.


Petya Osenova, Laska Laskova and Kiril Simov. 2010. Exploring Co-Reference Chains for Concept Annotation of Domain Texts. LREC 2010. Malta. pp. 172-176.

Kiril Simov and Petya Osenova. 2010. Constructing of an Ontology-based Lexicon for Bulgarian. LREC 2010. Malta. pp. 3840-3844.

Kiril Simov and Petya Osenova. 2010. Semantic Annotation for Semi-Automatic Positioning of the Learner. Supporting eLearning with Language Resources and Semantic Data Workshop 2010. LREC 2010. Malta. pp. 46-50.

Petya Osenova and Kiril Simov. 2010. Using the linguistic knowledge in BulTreeBank for the selection of the correct parses. The Ninth International Workshop on Treebanks and Linguistic Theories. Tartu, Estonia. pp. 163-174. ISSN: 1736-6305

Mariana Damova, Svetoslav Petrov and Kiril Simov. Data Driven and Upper Level Ontology. Artificial Intelligence: Methodology, Systems, and Applications. LNCS, 2010, Volume 6304/2010, 269-270.

Mariana Damova, Atanas Kiryakov, Kiril Simov, Svetoslav Petrov. Mapping the Central LOD Ontologies to PROTON Upper-Level Ontology. OM-2010: The Fifth International Workshop on Ontology Matching, ISWC-2010. Shanghai, China. pp 61-72.


Kiril Simov. Ontology-Based Lexicon of Bulgarian. Journal for Language Technology and Computational Linguistics. 2009. Volume 24, Number 2, pp. 40-55. ISSN 0175-1336.

Proki´c, E., J. Nerbonne, V. Zhobov, P. Osenova, K. Simov, T. Zastrow, E. Hinrichs. 2009. The Computational Analysis of Bulgarian Dialect Pronunciation. In: Serdica Journal of Computing v. 3, n3, pp. 269–298. ISSN 1312-6555

Chanev, Atanas, Kiril Simov, Petya Osenova and Svetoslav Marinov. The BulTreeBank: Parsing and Conversion. In: Nicolov, N., G. Angelova, and R. Mitkov (Eds.). Recent Advances in Natural Language Processing V: Selected papers from RANLP 2007. Vol. 309 in the series “Current Issues in Linguistic Theory”, John Benjamins Publ. Co., ISSN 0304-0763, Amsterdam, 2009, pp. 321-330.

Kiril Simov. A Knowledge-rich Lexicon for Bulgarian. In: Proceedings of the MONDILEX Third Open Workshop: Metalanguage and Encoding Scheme Design for Digital Lexicography, Bratislava, Slovakia, 15–16 April, 2009. ISBN 978-80-7399-745-8. pp 168-176

Kiril Simov and Petya Osenova. Syntactic-Semantic Treebank for Domain Ontology Creation. In: Proceedings of the MONDILEX Fourth Open Workshop: Representing Semantics in Digital Lexicography. Warsaw, Poland, 29 June – 1 July, 2009. ISBN 978-83-89191-87-8. pp 115-122

Voula Giouli, Nikos Glaros, Kiril Simov and Petya Osenova. A Web-Enabled and Speech-Enhanced Parallel Corpus of Greek-Bulgarian Cultural Texts. In: Proceedings of Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education (LaTeCH – SHELT&R 2009). ISBN 1-932432-21-3. pp 35-41

Monachesi, P., Markus, T., Osenova, P., Posea, V., Simov, K. and Trausan-Matu, S. (2009). Supporting knowledge discovery in an eLearning environment having social components. In: Internet conference: CISSE 2009 Conference.

Georgi Georgiev, Preslav Nakov, Kuzman Ganchev, Petya Osenova, and Kiril Simov. Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields. In: Angelova, G., K. Bontcheva, R. Mitkov, N. Nicolov, and N. Nikolov (Eds.) Proceedings of the International Conference RANLP-09 “Recent Advances in Natural Language Processing”, Borovets, Bulgaria, 14-16 September 2009, Published by Incoma Ltd., Shoumen, ISSN 1313-8502, 2009, pp. 113-117.

Georgi Georgiev, Preslav Nakov, Petya Osenova and Kiril Simov. Cross-lingual Adaptation as a Baseline: Adapting Maximum Entropy Models to Bulgarian. Proceedings of the Workshop on Adaptation of Language Resources and Technology to New Domains, in conjunction with RANLP’09, Borovetz, Bulgaria, September 17, 2009. ISBN 978-954-452-009-0. pp 35-38.


Kiril Simov and Petya Osenova. A Treatment of Coordination in the Bulgarian HPSG-based Treebank. 2008. In: G. Zybatow, L. Szucsich, U. Junghanns, R. Meyer (eds.), Proc. of Formal Description of Slavic Languages 5, PETER LANG Internationaler Verlag der Wissenschaften, series Linguistik International, 2008, ISSN 1436-6150, ISBN 978-3-631-55160-8, pp 68-77

Lothar Lemnitzer, Kiril Simov, Petya Osenova, Eelco Mossel and Paola Monachesi. 2008. Using a domain-ontology and semantic search in an eLearning environment. In: Innovative Techniques in Instruction Technology, E-learning, E-assessment, and Education. Springer Netherlands. pp 279-284. ISBN 978-1-4020-8738-7.

Kiril Simov, Petya Osenova and Svetlomira Vidinska. 2008. Selection of vocabulary and creation of spelling dictionary of Bulgarian on the basis of the corpus BulTreeBank. In: Studies in phraseology, lexicography. pp. 407-414. Bulgarian Academy of Sciences Publishing House 2008. ISBN 978-954-322-166-0 (in Bulgarian)

Elena Todorova and Kiril Simov. 2008. Frequency lexicon of the poetry of Yavorov. In: Studies in phraseology, lexicography. pp. 402-406. Bulgarian Academy of Sciences Publishing House 2008. ISBN 978-954-322-166-0 (in Bulgarian)

Kiril Simov and Petya Osenova. 2008. Language Resources and Tools for Ontology-Based Semantic Annotation. eds. Al. Oltramari, L. Prévot, Chu-Ren Huang, P. Buitelaar, P. Vossen OntoLex 2008 Workshop at LREC 2008, pp. 9-13. Published by the European Language Resource Association ELRA.

Petya Osenova, Kiril Simov, and Eelco Mossel. 2008. Language Resources for Semantic Document Annotation and Crosslingual Retrieval. In: Proc. of LREC 2008, Published by the European Language Resource Association ELRA.

Paola Monachesi, Kiril Simov, Eelco Mossel, Petya Osenova, Lother Lemnitzer. 2008. What can ontologies do for eLearning? In: Proceedings of The Third International Conferences on interactive Mobile and Computer Aided Learning . (IMCL 2008).

Kiril Simov and Petya Osenova. 2008. An Architecture for Semantic Annotation and Retrieval of Multimedia Documents. Invited talk. In: (eds) P. Stockinger, D. Dochev, Proc. of the Second LOGOS Open Workshop. Cross-Media and Personalized Learning Applications with Intelligent Content (LAIC 2008). 3 September 2008, Varna. Pp. 27-26. IIT-BAS, ISBN 978-954-91700-3-0

Kiril Simov and Petya Osenova. 2008. Bulgarian Language Resources for Information Technology. In: Leonid Iomdin, Ludmila Dimitrova (Eds.), Proceedings of the MONDILEX Open Workshop “Lexicographic Tools and Techniques”, Moscow, 3–4 October 2008. Pages 60-67. ISBN 978-5-9900813-6-9

Kiril Simov and Petya Osenova. 2008. A seed lexicon for Bulgarian. In: Abstract Proc. of Lexical-Semantic and Ontological Resources Maintenance, Representation, and Standards Workshop of the GLDV, Working Group on Lexicography at KONVENS 2008. Berlin, Germany. Page 4. Invited for a journal publication in GLDV-Forum, the journal of the German Society of Computational Linguistics


Petya Osenova and Kiril Simov. 2007. Formal Grammar of Bulgarian. IPP, BAS. 128 pages. ISBN 78-954-92148-2-6 (In Bulgarian).

Kiril Simov and Petya Osenova 2007: Applying a normalized compression metric to the measurement of dialect distance. In: Serdica Journal of Computing 1, pp. 73-86.

Cristina Vertan, Paola Monachesi, Kiril Simov, Petya Osenova, Lothar Lemnitzer, Alex Killing and Diane Evans. 2007. Crosslingual retrieval in an eLearning environment. Published in the proceedings of The 10th Congress of the Italian Association for Artificial Intelligence (AIIA 2007), LNCS. Volume 4733. pp. 839-847

Lothar Lemnitzer, Cristina Vertan, Alex Killing, Kiril Simov, Diane Evans, Dan Cristea, Paola Monachesi. 2007. Improving the search for learning objects with keywords and ontologies. Appeared in: Duval, Erik; Klamma, Ralf; Wolpers, Martin (Eds.) Creating New Learning Experiences on a Global Scale. Second European Conference on Technology Enhanced Learning, Lecture Notes in Computer Science , Vol. 4753, pp. 202-216. The paper won the best paper award at the EC-TEL conference.

Atanas Chanev, Kiril Simov, Petya Osenova and Svetoslav Marinov 2007: The BulTreeBank: Parsing and Conversion. In: Galia Angelova et. al (eds.) Proceedings from RANLP 2007, pp. 114-120

Adam Przepiórkowski, Lukasz Degórski, Miroslav Spousta, Kiril Simov, Petya Osenova, Lothar Lemnitzer, Vladislav Kubon, Beata Wójtowicz 2007: Towards the automatic extraction of definitions in Slavic. In the proceedings of the BSNLP workshop at ACL 2007, pp. 43-50

Petya Osenova and Kiril Simov. 2007. An Infrastructure for Storing and Processing Dialect Data. In: Bulgarian Islands on Balkans, Figura publ., 2007, pp. 256-263.

Kiril Simov, Petya Osenova, Alexander Simov, Anelia Tincheva, Borislav Kirilov 2007: A System for A Semi-Automatic Ontology Annotation. In: Proceedings from the International Workshop on Computer-Aided Language Processing (CALP), K. Orasan and S. Kubler, eds., RANLP 2007, 30 Sept. 2007, pp. 45-52.

Kiril Simov and Petya Osenova. Bulgarian Language Resources for Ontology-Based Semantic Search. In Proceedings of Workshop on a Common Natural Language Processing Paradigm For Balkan Languages, In conjunction with RANLP-2007 conference. 26.09.2007. Borovets, Bulgaria.

Kiril Simov and Petya Osenova. 2007. Applying Ontology-Based Lexicons to the Semantic Annotation of Learning Objects. In: Proceedings from the Workshop on NLP and Knowledge Represenattion for eLearning Environments, RANLP-2007, pp. 49-55.


Paola Monachesi, Lothar Lemnitzer and Kiril Simov. 2006. Language Technology for eLearning. In Innovative Approaches for Learning and Knowledge Sharing. Springer LNCS 4227. ISBN 978-3-540-45777-0

Kiril Simov and Petya Osenova. 2006. Semantic Annotation in Bulgarian Treebank. In volume: Readings in Multilinguality Selected papers for young researchers. IPP, BAS, Sofia, Bulagaria. pp. 109-116.

Kiril Simov, Petya Osenova. 2006. BulQA: Bulgarian-Bulgarian Question Answering at CLEF 2005. Working Notes for the CLEF 2005 Workshop, 21-23 September, Vienna, Austria. Lecture Notes in Computer Science 4022, Springer Verlag. pp 517-526.

Paola Monachesi, Cristea, D., Evans, D., Killing, A., Lemnitzer, L., Simov, K. and Vertan, C. 2006. Integrating language technology and semantic web techniques in elearning. Proceedings of ICL 2006

Petya Osenova and Kiril Simov. 2006. Special Linguistic Phenomena in the Bulgarian HPSG-based Treebank (BulTreeBank). In abstract proceedings of the The 13th International Conference on Head-Driven Phrase Structure Grammar, Varna, Bulgaria. pp. 176-182.

Kiril Simov, Petya Osenova. 2006. Ontology-based Lexicon and Semantic Annotation. In: Proceedings of International Workshop “Ontology Based Modelling in The Humanities”. Hamburg, Germany. pp. 66-71.

Kiril Simov, Petya Osenova. 2006. Shallow Semantic Annotation of Bulgarian. In: Proceedings of the Fifth International Conference on Language Resources and Evaluation. Genoa, Italy. pp. 2347-2352.

Chanev, A., Simov, K., Osenova, P., Marinov, S. 2006. Dependency Conversion and Parsing of the BulTreeBank. In Proceedings of the LREC workshop Merging and Layering Linguistic Information, Genoa, Italy, 2006. pp. 16-23.


Hristo Tanev, Milen Kouylekov, Bernardo Magnini, Matteo Negri, Kiril Simov. 2005. Exploiting Linguistic Indices and Syntactic Structures for Multilingual Question Answering: ITC-irst at CLEF 2005. Working Notes for the CLEF 2005 Workshop, 21-23 September, Vienna, Austria. Lecture Notes in Computer Science 4022, Springer Verlag. pp 390-399.

Ludmila Dimitrova, Radoslav Pavlov, Kiril Simov, Lydia Sinapova. Bulgarian MUL­TEXT-East Corpus – Structure and Content. Journal of Cybernetics and Information Technoloies. Vol. 5, No 1. Publishing House of Bulgarian Academy of Sciences. 2005. pages 67-73.

Kiril Simov and Petya Osenova. Extending the Annotation of BulTreeBank: Phase 2. The Fourth Workshop on Treebanks and Linguistic Theories (TLT 2005) Barcelona, 9-10 December 2005. pp 173-184

Tanev, Hristo, Milen Kouylekov, Bernardo Magnini, Matteo Negri, and Kiril Simov. 2005. Exploiting linguistic indices and syntactic structures for multilingual question answering: Itc-irst at clef 2005. In Proceedings of the Cross-Language Evaluation Forum 2005 workshop (CLEF), Vienna, Austria, September

Petya Osenova, Kiril Simov. Infrastructure for Bulgarian Question Answering. Implication for the Language Resources and Tools. Piperidis and Paskaleva (eds). Proc. Workshop on Language and Speech Infrastructure for Information Access in the Balkan Countries. Borovetc, Bulgaria. 2005. pp 47-52

The Proceedings of the Exploring Syntactically Annotated Corpora Workshop.


Kiril Simov and Petya Osenova. A Treebank-Driven Approach to Semantic Lexicons Creation. In: Proceedings of TLT04,Tuebingen, Germany. 2004.

The Proceedings of the ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP

Kiril Simov, Alexander Simov, Petya Osenova. An XML Architecture for Shallow and Deep Processing. In: The Proceedings of the ESSLLI 2004 Workshop on Combining Shallow and Deep Processing for NLP. 2004. pages 51-60.

Kiril Simov and Petya Osenova. A Hybrid Strategy for Regular Grammar Parsing. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 431-434.

Kiril Simov, Petya Osenova, Sia Kolkovska, Elisaveta Balabanova, Dimitar Doikoff. A Language Resources Infrastructure for Bulgarian. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 1685-1688.

BoС›o Bekavac, Petya Osenova, Kiril Simov, Marko Tadić. Making Monolingual Corpora Comparable: a Case Study of Bulgarian and Croatian. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 1187-1190.

Kiril Simov, Alexander Simov, Hristo Ganev, Krasimira Ivanova, Ilko Grigorov. The CLaRK System: XML-based Corpora Development System for Rapid Prototyping. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 235-238.

Tylman Ule, Kiril Simov. Unexpected Productions May Well be Errors. In: Proceedings of LREC 2004, Lisbon, Portugal. 2004. pages 1795-1798.

Kiril Simov, Petya Osenova, Alexander Simov, Krasimira Ivanova, Ilko Grigorov, Hristo Ganev. Creation of a Tagged Corpus for Less-Processed Languages with CLaRK System. In: Proceedings of SALTMIL Workshop at LREC 2004: First Steps in Language Documentation for Minority Languages, Lisbon, Portugal. 2004. pages 80-83.


Kiril Simov, Petya Osenova, Sia Kolkovska, Elisaveta Balabanova, Dimitar Doikoff. Language resources for the creation of a Bulgarian Treebank. In: Workshop on Balkan Language Resources and Tools, 21 November 2003, Thessaloniki, Greece (satellite event to the Balkan Conference on Informatics – BCI 2003). 2003.

Kiril Simov, Alexander Simov, Krassimira Ivanova, Ilko Grigorov, Hristo Ganev. The CLARK System Tools XML based Corpora development. In: Workshop on Balkan Language Resources and Tools, 21 November 2003, Thessaloniki, Greece (satellite event to the Balkan Conference on Informatics – BCI 2003). 2003.

Petya Osenova and Kiril Simov. The Bulgarian HPSG Treebank: Specialization of the Annotation Scheme. In: Proc. of The Second Workshop on Treebanks and Linguistic Theories (TLT2003), 14-15 November 2003, Växjö, Sweden.

Kiril Simov. HPSG-Based Annotation Scheme for Corpora Development and Parsing Evaluation. In: Proc. of the RANLP 2003 Conference, Borovets, Bulgaria, 10-12 September 2003. pages 432-439.

Kiril Simov and Petya Osenova. Practical Annotation Scheme for an HPSG Treebank of Bulgarian. In: Proc. of the 4th International Workshop on Linguistically Interpreteted Corpora (LINC-2003), Budapest, Hungary. 2003.

Kiril Simov, Alexander Simov, Milen Kouylekov, Krasimira Ivanova, Ilko Grigorov, Hristo Ganev. Development of Corpora within the CLaRK System: The BulTreeBank Project Experience. In: Proc. of the Demo Sessions of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL’03), Budapest, Hungary. 2003.

Tomaz Erjavec, Cvetana Krstev, Kiril Simov, Marko Tadic, Dusko Vitas. The MULTEXT-East Morphosyntactic Specifications for Slavic Languages. In: Proc. of the Workshop on Morphological Processing of Slavic Languages at EACL-2003, Budapest, Hungary. 2003.

The Proceedings of the Shallow Processing of Large Corpora (SProLaC 2003) Workshop

Petya Osenova and Kiril Simov. Between Chunk Ideology and Full Parsing Needs. In: Proceedings of the Shallow Processing of Large Corpora (SProLaC 2003) Workshop, Lancaster, UK. pages: 78-87.

Kiril Simov, Alexander Simov, Milen Kouylekov. Constraints for Corpora Development and Validation. In: Proc. of the Corpus Linguistics 2003 Conference, pages: 698-705.


Kiril Simov, Petya Osenova, Sia Kolkovska, Elisaveta Balabanova, Dimitar Doikoff, Krassimira Ivanova, Alexander Simov, Milen Kouylekov. Building a Linguistically Interpreted Corpus of Bulgarian: the BulTreeBank. In: Proceedings of LREC 2002, Canary Islands, Spain. 2002. pages 1729-1736. (Zipped Postscript version)

Kiril Simov, Milen Kouylekov, Alexander Simov. Incremental Specialization of an HPSG-Based Annotation Scheme. In: Proceedings of LREC 2002 Workshop on “Linguistic Knowledge Acquisition and Representation: Bootstrapping Annotated Language Data”, Canary Islands, Spain. 2002. pages 16-23. (Zipped Postscript version, Zipped PDF version, PDF version)

Kiril Simov. Grammar Extraction and Refinement from an HPSG Corpus. In: Proc. of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy. August 5-16, 2002. pages 38-55. (Zipped Postscript version).

Petya Osenova and Kiril Simov. Learning a token classification from a large corpus. (A case study in abbreviations). In: Proc. of the ESSLLI Workshop on Machine Learning Approaches in Computational Linguistics, Trento, Italy. August 5-16, 2002. pages 16-28. (Zipped Postscript version).

Petya Osenova and Kiril Simov. Bulgarian Vocative within HPSG framework. In: Proc. of the 9th International Conference on Head-Driven Phrase Structure Grammar (HPSG), Kyung Hee University, Seoul, South Korea. August 8-9, 2002. pages 94-100. (Postscript version, Zipped Postscript version, Zipped PDF version).

Kiril Simov, Milen Kouylekov, Alexander Simov. Cascaded Regular Grammars over XML Documents. In: Proc. of the 2nd Workshop on NLP and XML (NLPXML-2002), Taipei, Taiwan. September 1, 2002. pages 51-58. (Postscript version, Zipped Postscript version, Zipped PDF version).

Elisaveta Balabanova and Krassimira Ivanova. Creating a machine-readable version of Bulgarian valence dictionary: (A case study of CLaRK system application). In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 1-12.

Krassimira Ivanova and Dimitar Doikoff. Cascaded Regular Grammars and Constraints over Morphologically Annotated Data for Ambiguity Resolution. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 96-113.

Petya Osenova. Bulgarian Nominal Chunks and Mapping Strategies for Deeper Syntactic Analyses. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 150-166.

Petya Osenova and Sia Kolkovska. Combining the named-entity recognition task and NP chunking strategy for robust pre-processing. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 167-182.

Kiril Simov, Alexander Simov, Milen Kouylekov, Krassimira Ivanova. CLaRK System: Construction of Treebanks. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 183-198.

Milena Slavcheva. Segmentation Layers in the Group of the Predicate: a Case Study of Bulgarian within the BulTreeBank Framework. In: Proc. of The First Workshop on Treebanks and Linguistic Theories (TLT2002), 20th and 21st September 2002, Sozopol, Bulgaria. pages 199-210.

The Proceedings of the Treebanks and Linguistic Theories 2002 Workshop.


Kiril Simov, Zdravko Peev, Milen Kouylekov, Alexander Simov, Marin Dimitrov, Atanas Kiryakov. CLaRK – an XML-based System for Corpora Development. In: Proc. of the Corpus Linguistics 2001 Conference, pages: 558-560. Zipped PDF version

Kiril Simov, Gergana Popova, Petya Osenova. HPSG-based syntactic treebank of Bulgarian (BulTreeBank). In: Proc. of the Corpus Linguistics 2001 Conference, 561. (abstract)

Kiril Simov, Gergana Popova, Petya Osenova. HPSG-based syntactic treebank of Bulgarian (BulTreeBank). In: “A Rainbow of Corpora: Corpus Linguistics and the Languages of the World”, edited by Andrew Wilson, Paul Rayson, and Tony McEnery; Lincom-Europa, Munich 2002. (full version) pages 135-142

Milena Slavcheva. Review of Cann, Ronnie, Claire Grover and Philip Miller, ed. (2000) Grammatical Interfaces in HPSG Linguist List: Vol-12-1900, 12.1900.

Kiril Simov, Petya Osenova. A Hybrid System for MorphoSyntactic Disambiguation in Bulgarian. In: Proc. of the RANLP 2001 Conference, Tzigov Chark, Bulgaria, 5-7 September 2001. pages 288-290 (Postscript version, Zipped Postscript version, Zipped PDF version) A full version of the paper can be found here: (Postscript version, Zipped Postscript version) and (PDF version, Zipped PDF version).

Kiril Simov. Grammar Extraction from an HPSG Corpus. In: Proc. of the RANLP 2001 Conference, Tzigov Chark, Bulgaria, 5-7 September 2001. pages 285-287 (Postscript version, Zipped Postscript version, Zipped PDF version).

Petya Osenova and Kiril Simov. Review of Minnen, Efficient Processing with Constraint-Logic Grammars Using Grammar Compilation. LINGUIST List: Vol-12-3097. Sat Dec 15 2001.

Petya Osenova. On Subject-Verb Agreement in Bulgarian (An HPSG-based account). In: Proc. of the fourth Formal Description of Slavic Languages Conference, Potsdam, Germany, 2004. pages 661-672.

Technical Reports of the BulTreeBank Project

Kiril Simov. BTB-TR01: BulTreeBank Project Overview. BulTreeBank Project Technical Report № 01. 2004

Kiril Simov and Petya Osenova. BTB-TR02: BulTreeBank Text Corpus of Bulgarian: Content, Segmentation, Tokenization. BulTreeBank Project Technical Report № 02. 2004

Kiril Simov, Petya Osenova and Milena Slavcheva. BTB-TR03: BulTreeBank Morphosyntactic Tagset. BulTreeBank Project Technical Report № 03. 2004

Kiril Simov and Petya Osenova. BTB-TR04: BulTreeBank Morphosyntactic Annotation of Bulgarian Texts. BulTreeBank Project Technical Report № 04. 2004

Petya Osenova and Kiril Simov. BTB-TR05: BulTreeBank Stylebook. BulTreeBank Project Technical Report № 05. 2004

Kiril Simov, Alexander Simov, Hristo Ganev, Milen Kouylekov, Ilko Grigorov, Krasimira Ivanova. BTB-TR06: CLaRK — an XML-based System for Corpora Development. BulTreeBank Project Technical Report № 06. 2004