Skip to content

A robust merging algorithm for error-prone time-dependent string data, written in Rust

License

Notifications You must be signed in to change notification settings

ulagbulag/string-combiner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

String Combiner

We have developed a robust merging algorithm for error-prone time-dependent string data. This repository provides out-of-the-box functionality for merging time-series string data. Our goal is for this to be used in string preprocessing and merging stages in real-time voice data processing pipelines, such as ASR (Automatic Speech Recognition).

use string_combiner::StringCombiner;

fn main() {
    let inputs = vec![
        "Hello World".chars(),
        "Hello worl d!".chars(),
        "내 어린시절 우연히?".chars(),
        "시찰 우연히 들었던 ".chars(),
        "우연히 들었던 믿지 못할 한 마디".chars(),
        "Hello bold".chars(),
    ];

    let combiner = StringCombiner {
        threshold_deletion_x: 3,
        ..Default::default()
    };
    let combined = combiner
        .concat_strings(inputs)
        .expect("Failed to concat texts");

    let expected = "내 어린시절 우연히 들었던 믿지 못할 한 마디";
    assert_eq!(expected, combined);
}

LICENSE

Please check our LICENSE file.

About

A robust merging algorithm for error-prone time-dependent string data, written in Rust

Resources

License

Stars

Watchers

Forks

Languages