The table below lists the status and error messages associated with jobs.
Job Status
Description
Next Steps
Complete
Job has been successfully processed.
No next steps.
Incomplete
Job is still being processed.
Check the logs of the container that correspond with the service owner of the job. If a container has stopped running, restart that container. Then call the retry endpoint for the service. To learn about retry endpoints, see Retry Logic.
No Pipeline Found
No tasks have a job selector that matches with the provided image.
Ensure that the desired task has the appropriate job selector. To modify a task in the Web UI, follow the instructions for Update Task.
Pipeline Error
The pipeline processing the image failed.
Check the job Error Details column or QC flags field for details about the error. See View Jobs in Web UI. Also, check the log for the pipeline container. If the error is permissions-based, the logs may say permission denied or failed to write file. In this case, on the Ubuntu system, run: make files or sudo make files.
Transmission Error
The job could not be transferred to the message bus.
Check the logs of the service in the Error Details column. See View Jobs in Web UI. If necessary, call the retry endpoint for the appropriate service, restart the pipeline and/or restart the Redis container. To learn about retry endpoints, see Retry Logic.
File Error (Input)
The input file could not be transferred from the File Sender OEM to the File Receiver Gateway.
Check the logs of the File Sender OEM. If the owner of the job is still the File Sender OEM, then call the retry endpoint using instructions in Retry Logic.
File Error (Output)
The output file(s) could not be transferred from the File Sender Gateway to the File Receiver OEM.
To learn how to check the error details for the failing output file(s), See Output File Status Info. Check the logs of the File Receiver OEM and the File Sender Gateway for the job. If the owner of the job is still the File Receiver OEM, call the retry endpoint for the File Sender Gateway and File Receiver OEM using the instructions in Retry Logic.
See the columns Job Status, Pipeline Status and Error Details, which displays detailed information about the error.
Compare the contents of these columns with the Job Status table above.
Example
The Error Details column presents with this format: (<service name>): <error>.
The <service name> is the service to investigate.
The <error> is the cause of the failure.
Example: (data-organizer): no tasks could be matched to the input file name
For information about how to view a container's logs, see Check Logs.
Check the Error Details column to determine the service that failed to process the file. From
there, trace the logs back to see the context of the error. If necessary, restart the affected containers or call
the appropriate retry endpoints. See Retry Logic.
Any time the ssh keys are regenerated on a two system setup, the public key must be copied back from the Gateway System to the OEM system.
Once the key is copied over tho the OEM, the OEM services must be rebuilt in order to have a copy of the new key.
If any of the OEM services (ie app-file-watcher) have an error that looks like:
level=INFOts=2023-06-13T19:10:25.315559848Zapp=app-file-watchersource=methods.go:138msg="using Unix Domain Socket at unix:///tmp/edgex/secrets/spiffe/public/api.sock"
then the services may have hung.
Here are some potential fixes for this issue:
Check that systems are on the same network by verifying that the Gateway system can ping the IP address of the OEM system.
Check that the public key edgex-res/id_rsa.pub on the Gateway and edgex-res/remote/sshd-remote/authorized_keys on the OEM file are the same.
Verify that the TUNNEL_HOST: <OEM_IP_ADDRESS> in the docker-compose-edgex-spiffe-spire.yml address matches the OEM system.
Ensure that the logs for both the oem-ssh-proxy on the Gateway and the remote-spire-agent on the OEM have the appropriate logs as described in System Setup
If you try to run the golang tools like the make auto-verify-clients or make client-update and run into mockery: command not found, there is most likely an issue with environment variables. Similar issue - go-test-report: not found can occur when running make test-report.
The fix can be found below:
Note
You will need sudo privileges to implement this fix.