Representing Sets of Entites for Matching Problems


Methods, systems, and computer-readable storage media for providing a set of column pairs, each column pair including a column of a bank statement table, and a column of a super invoice table, each column pair corresponding to a modality, the super invoice table including at least one row including data associated with multiple invoices, for each column pair, determining a feature descriptor based on an operator, a feature vector being provided based on feature descriptors of the set of column pairs, inputting the feature vector to a ML model that processes the feature vector to determine a probability of a match between the bank statement, and a super invoice represented by the super invoice table, and outputting a binary output representing one of a match and no match between the bank statement, and the super invoice based on the probability.

US Patent App. 16208681