fastchat.data.clean_sharegpt
Convert html to markdown with basic data cleaning.
Deduplication.
Usage: python3 -m fastchat.data.clean_sharegpt –in sharegpt_html.json –out sharegpt_clean.json
Module Contents
Functions
|
Clean the source html files. |