Publications
An Alignment Cost-Based Classification of Log Traces Using Machine-Learning
Conformance checking is an important aspect of process mining that identifies the differences between the behaviors recorded in a log and those exhibited by an associated process model. Machine learning and deep learning methods perform extremely well in sequence analysis. We successfully apply both a Recurrent Neural Network and a Random Forest classifier to the problem of evaluating whether the alignment cost of a log trace to a process model is below an arbitrary threshold, and provide a lower bound for the fitness of the process model based on the classification.
Publications
Do Sentence Embeddings Capture Discourse Properties of Sentences from Scientific Abstracts
We introduce four tasks designed to determine which sentence encoders best capture discourse properties of sentences from scientific abstracts, namely coherence between clauses of a sentence, and discourse relations within sentences. We show that even if contextual encoders such as BERT or SciBERT encodes the coherence in discourse units, they do not help to predict three discourse relations commonly used in scientific abstracts. We discuss what these results underline, namely that these discourse relations are based on particular phrasing that allow non-contextual encoders to perform well.
Publications
Alignement de Structures Argumentatives et Discursives par Fouille de Graphes et de Redescriptions
Dans cet article, nous étudions la similarité entre structures argumentatives et discursives en alignant des sous-arbres dans un corpus annoté en RST et en structure argumentative. Contrairement aux travaux précédents, nous nenous intéressons pas uniquement à un alignement relation à relation, mais à unalignement de sous-structures. À l’aide de méthodes de fouille de données, nous montrons que des similitudes existent entre l’argumentation et le discours. L’annotation multiple du corpus permet également de proposer un alignement entreles structures.
Publications
Aligning Discourse and Argumentation Structures using Subtrees and Redescription Mining
In this paper, we investigate similarities between discourse and argumentation structures by aligning subtrees in a corpus containing both annotations. Contrary to previous works, we focus on comparing sub-structures and not only relation matches. Using data mining techniques , we show that discourse and argumen-tation most often align well, and the double annotation allows to derive a mapping between structures. Moreover, this approach enables the study of similarities between discourse structures and differences in their expressive power.
Publications
AOC-poset of discourse and argumentation: what can we learn on their dependencies?
We aim at finding and understanding dependencies between linguistic structures which differ in terms of constraints and expressive power. It has been shown that studying dependencies between the argu- mentation structure (ARG) and the Rhetorical Structure Theory (RST) is non-trivial and requires a fine methodology. In this paper, we propose to take advantage of the AOC-Poset structure to understand how the subgraphs alignements occur in a small corpus annotated in ARG and RST.