1. Trang chủ
  2. » Công Nghệ Thông Tin

Artificial Intelligence and Natural Language

305 50 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 305
Dung lượng 13,09 MB

Nội dung

Andrey Filchenkov Lidia Pivovarova Jan Žižka (Eds.) Communications in Computer and Information Science 789 Artificial Intelligence and Natural Language 6th Conference, AINL 2017 St Petersburg, Russia, September 20–23, 2017 Revised Selected Papers 123 Communications in Computer and Information Science Commenced Publication in 2007 Founding and Former Series Editors: Alfredo Cuzzocrea, Xiaoyong Du, Orhun Kara, Ting Liu, Dominik Ślęzak, and Xiaokang Yang Editorial Board Simone Diniz Junqueira Barbosa Pontifical Catholic University of Rio de Janeiro (PUC-Rio), Rio de Janeiro, Brazil Phoebe Chen La Trobe University, Melbourne, Australia Joaquim Filipe Polytechnic Institute of Setúbal, Setúbal, Portugal Igor Kotenko St Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences, St Petersburg, Russia Krishna M Sivalingam Indian Institute of Technology Madras, Chennai, India Takashi Washio Osaka University, Osaka, Japan Junsong Yuan Nanyang Technological University, Singapore, Singapore Lizhu Zhou Tsinghua University, Beijing, China 789 More information about this series at http://www.springer.com/series/7899 Andrey Filchenkov Lidia Pivovarova Jan Žižka (Eds.) • Artificial Intelligence and Natural Language 6th Conference, AINL 2017 St Petersburg, Russia, September 20–23, 2017 Revised Selected Papers 123 Editors Andrey Filchenkov ITMO University St Petersburg Russia Jan Žižka Mendel University Brno Czech Republic Lidia Pivovarova University of Helsinki Helsinki Finland ISSN 1865-0929 ISSN 1865-0937 (electronic) Communications in Computer and Information Science ISBN 978-3-319-71745-6 ISBN 978-3-319-71746-3 (eBook) https://doi.org/10.1007/978-3-319-71746-3 Library of Congress Control Number: 2017960865 © Springer International Publishing AG 2018 This work is subject to copyright All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed The use of general descriptive names, registered names, trademarks, service marks, etc in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions that may have been made The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations Printed on acid-free paper This Springer imprint is published by Springer Nature The registered company is Springer International Publishing AG The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland Preface The 6th Conference on Artificial Intelligence and Natural Language Conference (AINL), held during September 20–23, 2017, in Saint Petersburg, Russia, was organized by the NLP Seminar and ITMO University Its aim was to (a) bring together experts in the areas of natural language processing, speech technologies, dialogue systems, information retrieval, machine learning, artificial intelligence, and robotics and (b) to create a platform for sharing experience, extending contacts, and searching for possible collaboration Overall, the conference gathered more than 100 participants The review process was challenging Overall, 35 papers were sent to the conference and only 17 were selected, for an acceptance rate of 48% In all, 56 researchers from different domains and areas were engaged in the double-blind reviewing process Each paper received at least three reviews, in many cases there were four reviews Beyond regular papers, the proceedings contain six papers about the Russian Paraphrase Detection shared task, which took place at the AINL 2016 conference These papers followed a slightly different review process and were not anonymized for reviews Altogether, 17 papers were presented at the conference, covering a wide range of topics, including social data analysis, dialogue systems, speech processing, information extraction, Web-scale data processing, word embedding, topic modeling, and transfer learning Most of the presented papers were devoted to analyzing human communication and creating algorithms to perform such analysis In addition, the conference program included several special talks and events, including tutorials on neural machine translation, deception detection in language, a hackathon for plagiarism detection in Russian texts, an invited talk on the shape of the future of computational science, industry talks and demos, and a poster session Many thanks to everybody who submitted papers and gave wonderful talks, and to whose who came and participated without publication We are indebted to our Program Committee members for their detailed and insightful reviews; we received very positive feedback from our authors even from those whose submissions were rejected And last but not the least, we are grateful to our organization team: Anastasia Bodrova, Irina Krylova, Aleksandr Bugrovsky, Natalia Khanzhina, Ksenia Buraya, and Dmitry Granovsky November 2017 Andrey Filchenkov Lidia Pivovarova Jan Žižka Organization Program Committee Jan Žižka (Chair) Jalel Akaichi Mikhail Alexandrov Artem Andreev Artur Azarov Alexandra Balahur Siddhartha Bhattacharyya Svetlana Bichineva Victor Bocharov Elena Bolshakova Pavel Braslavski Maxim Buzdalov John Cardiff Dmitry Chalyy Daniil Chivilikhin Dan Cristea Frantisek Darena Gianluca Demartini Marianna Demenkova Dmitry Granovsky Maria Eskevich Vera Evdokimova Alexandr Farseev Andrey Filchenkov Tatjana Gornostaja Mark Granroth-Wilding Jiří Hroza Tomáš Hudík Camelia Ignat Denis Kirjanov Goran Klepac Daniil Kocharov Artemy Kotov Miroslav Kubat Andrey Kutuzov Nikola Ljubešić Mendel University of Brno, Czech Republic King Khalid University, Tunisia Autonomous University of Barcelona, Spain Russian Academy of Science, Russia Saint Petersburg Institute for Informatics and Automation, Russia European Commission, Joint Research Centre, Ispra, Italy RCC Institute of Information Technology, India Saint Petersburg State University, Russia OpenCorpora, Russia Moscow State Lomonosov University, Russia Ural Federal University, Russia ITMO University, Russia Institute of Technology Tallaght, Dublin, Ireland Yaroslavl State University, Russia ITMO University, Russia A I Cuza University of Iasi, Romania Mendel University in Brno, Czech Republic University of Sheffield, UK Kefir Digital, Russia Yandex, Russia Radboud University, The Netherlands Saint Petersburg State University, Russia Singapore National University, Singapore ITMO University, Russia Tilde, Latvia University of Helsinki, Finland Rare Technologies, Czech Republic Think Big Analytics, Czech Republic Joint Research Centre of the European Commission, Ispra, Italy Higher School of Economics, Russia University of Zagreb, Croatia Saint Petersburg State University, Russia Kurchatov Institute, Russia University of Miami, FL, USA University of Oslo, Norway Jožef Stefan Institute, Slovenia VIII Organization Natalia Loukachevitch Kirill Maslinsky Vladislav Maraev George Mikros Alexander Molchanov Sergey Nikolenko Alexander Panchenko Allan Payne Jakub Piskorski Lidia Pivovarova Ekaterina Protopopova Paolo Rosso Eugen Ruppert Ivan Samborskii Arun Kumar Sangaiah Christin Seifert Serge Sharoff Jan Šnajder Maria Stepanova Hristo Tanev Irina Temnikova Michael Thelwall Alexander Troussov Vladimir Ulyantsev Dmitry Ustalov Natalia Vassilieva Mikhail Vink Wajdi Zaghouani Moscow State University, Russia National Research University Higher School of Economics, Russia University of Gothenburg, Sweden National and Kapodistrian University of Athens, Greece PROMT, Russia Steklov Mathematical Institute, St Petersburg, Russia Universität Hamburg, Germany American University in London, UK Joint Research Centre of the European Commission, Ispra, Italy University of Helsinki, Finland Saint Petersburg State University, Russia Technical University of Valencia, Spain TU Darmstadt - FG Language Technology, Germany Singapore National University, Singapore VIT University, Tamil Nadu, India University of Passau, Germany University of Leeds, UK University of Zagreb, Croatia ABBYY, Russia Joint Research Centre of the European Commission, Ispra, Italy Qatar Computing Research Institute, Qatar University of Wolverhampton, UK Russian Presidential Academy of National Economy and Public Administration, Russia ITMO University, Russia Lappeenranta University of Technology, Finland Hewlett Packard Labs, USA JetBrains, Germany Carnegie Mellon University Qatar Contents Social Interaction Analysis Semantic Feature Aggregation for Gender Identification in Russian Facebook Polina Panicheva, Aliia Mirzagitova, and Yanina Ledovaya Using Linguistic Activity in Social Networks to Predict and Interpret Dark Psychological Traits Arseny Moskvichev, Marina Dubova, Sergey Menshov, and Andrey Filchenkov Boosting a Rule-Based Chatbot Using Statistics and User Satisfaction Ratings Octavia Efraim, Vladislav Maraev, and João Rodrigues 16 27 Speech Processing Deep Learning for Acoustic Addressee Detection in Spoken Dialogue Systems Aleksei Pugachev, Oleg Akhtiamov, Alexey Karpov, and Wolfgang Minker Deep Neural Networks in Russian Speech Recognition Nikita Markovnikov, Irina Kipyatkova, Alexey Karpov, and Andrey Filchenkov Combined Feature Representation for Emotion Classification from Russian Speech Oxana Verkholyak and Alexey Karpov 45 54 68 Information Extraction Active Learning with Adaptive Density Weighted Sampling for Information Extraction from Scientific Papers Roman Suvorov, Artem Shelmanov, and Ivan Smirnov 77 Application of a Hybrid Bi-LSTM-CRF Model to the Task of Russian Named Entity Recognition The Anh Le, Mikhail Y Arkhipov, and Mikhail S Burtsev 91 X Contents Web-Scale Data Processing Employing Wikipedia Data for Coreference Resolution in Russian Ilya Azerkovich 107 Building Wordnet for Russian Language from Ru.Wiktionary Yuliya Chernobay 113 Corpus of Syntactic Co-Occurrences: A Delayed Promise Eduard S Klyshinsky and Natalia Y Lukashevich 121 Computation Morphology and Word Embeddings A Close Look at Russian Morphological Parsers: Which One Is the Best? Evgeny Kotelnikov, Elena Razova, and Irina Fishcheva 131 Morpheme Level Word Embedding Ruslan Galinsky, Tatiana Kovalenko, Julia Yakovleva, and Andrey Filchenkov 143 Comparison of Vector Space Representations of Documents for the Task of Information Retrieval of Massive Open Online Courses Julius Klenin, Dmitry Botov, and Yuri Dmitrin 156 Machine Learning Interpretable Probabilistic Embeddings: Bridging the Gap Between Topic Models and Neural Networks Anna Potapenko, Artem Popov, and Konstantin Vorontsov Multi-objective Topic Modeling for Exploratory Search in Tech News Anastasia Ianina, Lev Golitsyn, and Konstantin Vorontsov A Deep Forest for Transductive Transfer Learning by Using a Consensus Measure Lev V Utkin and Mikhail A Ryabinin 167 181 194 Russian Paraphrase Detection Shared Task ParaPhraser: Russian Paraphrase Corpus and Shared Task Lidia Pivovarova, Ekaterina Pronoza, Elena Yagunova, and Anton Pronoza Effect of Semantic Parsing Depth on the Identification of Paraphrases in Russian Texts Kirill Boyarsky and Eugeni Kanevsky 211 226 ... this series at http://www.springer.com/series/7899 Andrey Filchenkov Lidia Pivovarova Jan Žižka (Eds.) • Artificial Intelligence and Natural Language 6th Conference, AINL 2017 St Petersburg, Russia,... laws and regulations and therefore free for general use The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate... registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland Preface The 6th Conference on Artificial Intelligence and Natural Language Conference (AINL), held during September 20–23, 2017,

Ngày đăng: 29/12/2020, 16:06

Nguồn tham khảo

Tài liệu tham khảo Loại Chi tiết
3. Nenkova, A., McKeown, K.: A survey of text summarization techniques. In: Aggar- wal, C., Zhai, C. (eds.) Mining Text Data Book, pp. 43–76. Springer, Boston (2012).https://doi.org/10.1007/978-1-4614-3223-4 3 Link
17. Kozareva, Z., Montoyo, A.: Paraphrase identification on the basis of supervised machine learning techniques. In: Salakoski, T., Ginter, F., Pyysalo, S., Pahikkala, T. (eds.) FinTAL 2006. LNCS (LNAI), vol. 4139, pp. 524–533. Springer, Heidelberg (2006). https://doi.org/10.1007/11816508 52 Link
28. Guarino, N.: The ontological level: revisiting 30 years of knowledge representa- tion. In: Borgida, A.T., Chaudhri, V.K., Giorgini, P., Yu, E.S. (eds.) Conceptual Modeling: Foundations and Applications. LNCS, vol. 5600, pp. 52–67. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-02463-4 4 Link
1. Fader, A., Zettlemoyer, L.S., Etzioni, O.: Paraphrase-driven learning for open ques- tion answering. In: Proceedings of ACL-2013, pp. 1608–1618 (2013) Khác
2. Vossen, P., Rigau, G., Serafini, L., Stouten, P., Irving, F., van Hage, W.R.: News- Reader: recording history from daily news streams. In: Proceedings of LREC-2014, pp. 2000–2007 (2014) Khác
4. Loukachevitch, N., Alekseev, A.: Summarizing news clusters on the basis of the- matic chains. In: Proceedings of LREC-2012, pp. 1600–1607 (2012) Khác
5. Clough, P., Gaizauskas, R., Piao, S., Wilks, Y.: METER: MEasuring TExt reuse.In: Proceedings of the 40th Anniversary Meeting for the Association for Compu- tational Linguistics (ACL 2002), pp. 152–159 (2002) Khác
6. Marton, Y., Callison-Burch, C., Resnik, P.: Improved statistical machine transla- tion using monolingually-derived paraphrases. In: Proceedings of the 2009 Confer- ence on Empirical Methods in Natural Language Processing, EMNLP-2009, pp.381–390 (2009) Khác
7. Dolan, W.B., Quirk, C., Brockett, C.: Unsupervised construction of large para- phrase corpora: exploiting massively parallel news sources. In: Proceedings of the 20th International Conference on Computational Linguistics, Coling-2004, Geneva, Switzerland (2004) Khác
10. Agirre, E., Banea, C., Cer, D., Diab, M., Gonzalez-Agirre, A., Mihalcea, R., Wiebe, J.: Semeval-2016 task 1: semantic textual similarity, monolingual and cross-lingual evaluation. In: Proceedings of SemEval, pp. 497–511 (2016) Khác
12. Han, L., Kashyap, A., Finin, T., Mayfield, J., Weese, J.: UMBC EBIQUITY- CORE: semantic textual similarity systems. In: Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 1: Proceedings of the Main Con- ference and the Shared Task: Semantic Textual Similarity, Atlanta, Georgia, USA, June, pp. 44–52. Association for Computational Linguistics (2013) Khác
13. Loukachevitch, N., Dobrov, B.: RuThes linguistic ontology vs. Russian wordnets.In: Proceedings of Global WordNet Conference GWC-2014, pp. 154–162 (2014) 14. Pronoza, E., Yagunova, E., Pronoza, A.: Construction of a Russian paraphrase cor-pus: unsupervised paraphrase extraction. In: Braslavski, P., Markov, I., Pardalos, P., Volkovich, Y., Ignatov, D.I., Koltsov, S., Koltsova, O. (eds.) RuSSIR 2015 Khác
15. Pivovarova, L., Pronoza, E., Yagunova, E., Pronoza, A.: ParaPhraser: Russian paraphrase corpus and shared task. In: Filchenkov, A., et al. (eds.) AINL 2017.CCIS, vol. 789, pp. 211–225. Springer, Cham (2018) Khác
16. Loukachevitch, N., Shevelev, A., Mozharova V.: Testing features and methods in Russian Paraphrasing Task. In: Proceedings of International Conference on Com- putational Linguistics and Intellectual Technologies Dialog 2017, vol. 1, pp. 135–145 (2017) Khác
18. Pronoza, E., Yagunova, E.: Low-level features for paraphrase identification. In:Sidorov, G., Galicia-Haro, S.N. (eds.) MICAI 2015. LNCS (LNAI), vol. 9413, pp Khác
20. Mihalcea, R., Corley, C., Strapparava C.: Corpus-based and Knowledge-based mea- sures of text semantic similarity. In: Proceedings of the American Association for Artificial Intelligence (2006) Khác
21. Fernando, S., Stevenson, M.: A semantic similarity approach to paraphrase detec- tion. In: Proceedings of the 11th Annual Research Colloquium of the UK Special Interest Group for Computational Linguistics, pp. 45–52 (2008) Khác
22. Bar, D., Biemann, C., Gurevych, I., Zesch, T.: UKP: computing semantic textual similarity by combining multiple content similarity measures. In: Proceedings of the 6th International Workshop on Semantic Evaluation, Held in Conjunction with the 1st Joint Conference on Lexical and Computational Semantics, pp. 435–440 (2012) Khác
23. Rychalska, B., Pakulska, K., Chodorowska, K., Walczak, W., Andruszkiewicz, P.:Samsung Poland NLP team at SemEval-2016 Task 1: necessity for diversity; com- bining recursive autoencoders, wordnet and ensemble methods to measure semantic similarity. In: Proceedings of the 10th International Workshop on Semantic Eval- uation (SemEval 2016), San Diego, CA, USA (2016) Khác
24. Gurevych, I., Niederlich, H.: Computing semantic relatedness in German with revised information content metrics. In: Proceedings of OntoLex 2005 - Ontolo- gies and Lexical Resources, IJCNLP 2005 Workshop (2005) Khác

TỪ KHÓA LIÊN QUAN

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN