Inference with VLM

Installations

Clone project

git clone https://github.com/convince-project/sit-aw-aip.git

⚠ At this stage ignore vLLM-hosting folder

Build your project - Once

uv sync --frozen

Activate virtual env - everytime you enter the project

source .venv/bin/activate

Generate the right data format

This script is in charge of formatting the data in accordance to what is expected by the model and given some required input and data structure.

formatData \
--use_case_id {id} \
--root_path {root_path}

Variables

use_case_id : 1,2 or 3 given CONVINCE Use cases order

root_path : root_path to all anomalies data, the folders are structured the following way given the use case; only the required fields have to be present before hand :

UC1

-- root
-- Anomaly 1
    -- csv_images_files (will be generated)
            angular_imu_velocity.png
            base_current_velocity.png
            odom_vel.png
            trajectory.png
    -- images (will be generated)
            [all images files]
    -- text_files (will be generated)
            class_action.txt
    -- video (will be generated)
            video.mp4
    ros_file.mcap (required! with this extension!)
-- Anomaly 2 (same as 1)
-- Anomaly 3 (same as 1)
-- (repeat)

UC2

-- root
-- Anomaly 1
    -- block 1
        -- folder 1 (required!)
                chest_cam_video.mp4
                proprioception.csv
        -- folder 2 (required!)
                scan_image.png
        -- csv_images_files (will be generated)
                graph_image_csv_images_files.png
        -- video (will be generated)
                video.mp4
        -- images (will be generated)
                [all images files]
    -- block 2 (same as block1)
    -- block 3 (same as block1)
    -- (repeat)
-- Anomaly 2 (same as Anomaly 1)
-- (repeat)

Csv file columns and representation (elements in brackets represent numbers) - please refer for your data structure

timestamp	name	position
{timestamp_0}	gripper_jaws	{value_0}

Other names in the name column can be present, but the gripper_jaws has to be.

Example given uc2 previously presented data structre :

formatData \
--use_case_id 2 \
--root_path home/root/

Send an identification request to the VLM

If you prefer to use a local VLM - works only with our chosen model

inference_local \
--use_case_id {id} \
--anomaly_case_path {root_path_to_one_anomaly_case}

If you prefer the hosted VLM

inference_server \
--use_case_id {id} \
--anomaly_case_path {root_path_to_one_anomaly_case}

Hosted VLM variables

There are three environment variables defined in the .env at the root. The SERVER_IP variable need to be changed to the IP of the distant machine where the model is hosted, else it will consider localhost and result in error. The two other variables MODEL and PORT have to correspond with the ones defined when deploying the model.

Shared variables

use_case_id : 1,2 or 3 given the use case you want to treat within CONVINCE use case.

anomaly_case_path: within the selected use case and the formatted data, the root_path to the desired anomaly to treat, where all folders are.

Example given uc1 previously presented data structre :

inference_local \
--use_case_id 1 \
--anomaly_case_path home/root/Anomaly\1

Check with UC2 samples

Some samples are provided to you in the examples folder, which represents the root.

Start by formatting the data:

formatData \
--use_case_id 1 \
--root_path examples/

Then, execute an inference with either the local or deployed model, by selecting one block folder:

local model:

inference_local \
--use_case_id 2 \
--anomaly_case_path examples/AN01/a58_00_2025_02_25_09_07_59

deployed model:

inference_server \
--use_case_id 2 \
--anomaly_case_path examples/AN01/a58_00_2025_02_25_09_07_59