data-utils

Use this repository to easily convert your source files (csv, txt, excel, json, html) into record-oriented JSON files that can be uploaded into onetask.

Installation

Clone this repository and install the dependencies into an environment (e.g. Conda) using pip install -r requirements.txt

How to use

This tool is CLI-based, so you can just open a terminal, change directory into this repository, and then execute python json_converter.py --filename <your_filename>

For instance, if you have a file my_input_file.csv with the following data

running_id,text
1,hello world!
2,how are you doing?
3,if you have any questions about onetask - contact us anytime

, running python json_converter.py --filename my_input_file.csv will output

[
    {
        "running_id": 1,
        "text": "hello world!"
    },
    {
        "running_id": 2,
        "text": "how are you doing?"
    },
    {
        "running_id": 3,
        "text": "if you have any questions about onetask - contact us anytime"
    }
]

You can configure the conversion in typical Pandas style by providing arguments, e.g. for the delimiter of csv files such as python json_converter.py --filename my_input_file.csv --sep ';'

We currently support:

  • Excel (supports xls, xlsx, xlsm, xlsb, odf, ods and odt file extensions read from a local filesystem or URL)
  • CSV (also working for .txt files)
  • HTML
  • JSON

Support

If you have any questions or run into issues, feel free to contact us anytime.

GitHub - onetask-ai/data-utils at pythonawesome.com
Data Utilities e.g. for importing files to onetask - GitHub - onetask-ai/data-utils at pythonawesome.com