Paraphrasensuche mittels word2vec und der Word Mover’s Distance im Altgriechischen

Marcus Pöckelmann, Jörg Ritter, Eva Wöckener-Gade, Charlotte Schubert

Abstract


To find receptions of Plato‘s work within the ancient Greek literature, automatic methods would be a useful assistance. Unfortunately, such methods are often knowledge-based and thus restricted to extensively annotated texts, which are not available to a sufficient extent for ancient Greek. In this article, we describe an approach that is based on the distributional hypotheses instead, to overcome the problem of missing annotations. This approach uses word2vec and the related Word Mover‘s Distance to determine phrases with similar meaning. Despite its experimental state, the method produces some meaningful results as shown in three examples. 


Schlagworte


Platon; Rezeption; Paraphrasen; word2vec; Word Mover’s Distance

Volltext:

PDF

Literaturhinweise


Androutsopoulos, Ion / Malakasiotis, Prodromos (2010): „A Survey of Paraphrasing and Textual Entailment Methods“ in: Journal of Artificial Intelligence Research 38: 135–187.

Dik, Helma / Whaling, Richard (2008): „Bootstrapping Classical Greek Morphology“ in: Proceedings of the 19. international annual conference of Digital Humanities, Oulu, Finland: 105–106.

Harris, Zellig S. (1954): „Distributional Structure.“ in: WORD, 10 (2-3): 146–162.

Kaltwasser, Johann F. S. (1783–1800): Plutarchs moralische Abhandlungen (übers.), 9 Bd.e, Frankfurt.

Kusner, Matt J. / Sun, Yu / Kolkin, Nicholas I. / Weinberger, Kilian Q. (2015): „From Word Embeddings To Document Distances“ in: Proceedings of the 32. International Conference on Machine Learning, Lille, France: 957–966.

Mikolov, Tomas / Sutskever, Ilya / Chen, Kai / Corrado, Greg S. / Dean, Jeff (2013): „Distributed representations of words and phrases and their compositionality“ in: Advances in Neural Information Processing Systems 26: 3111–3119.

Müller, Hermann F. (1878–1880): Die Enneaden des Plotin (übers.), 2 Bd.e, Berlin.

Řehůřek, Radim / Sojka, Petr (2010): „Software Framework for Topic Modelling with Large Corpora“ in: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, Valletta, Malta: 45–50.

Schleiermacher, Friedrich D. E. (1855): Platon – Werke (übers.), 3. unveränderte Auflage, 6 Bd.e, Berlin.

Stählin, Otto (1937): Clemens von Alexandrien: Teppiche. Wissenschaftliche Darlegungen entsprechend der wahren Philosophie (Stromateis) (übers.), Bibliothek der Kirchenväter, 2. Reihe, Bd. 19, München.

Teuffel, Wilhelm S. / Wiegand, Wilhelm (1855): Platon‘s Werke. Zehn Bücher vom Staate (übers.). Stuttgart.





DOI: https://doi.org/10.11588/dco.2017.0.40185

URN (PDF): http://nbn-resolving.de/urn:nbn:de:bsz:16-dco-401853

Refbacks

  • Im Moment gibt es keine Refbacks