Natalia Analytics troubleshooting
Operational guide for the most frequent errors. Every entry maps to an error code documented in the complete reference. Case not listed? Contact [email protected].
Ports & firewall matrix
Reference for the network team. The agent VM never opens an inbound port. All traffic is outbound, initiated by the agent.
| Source | Destination | Port | Protocol | Direction | Mandatory? |
|---|---|---|---|---|---|
| Agent VM | PBX OXE | 22 |
TCP / SSH | Outbound | YES (OXE mode) |
| Agent VM | PBX OXO | 443 or 30443 |
TCP / HTTPS | Outbound | YES (OXO mode) |
| Agent VM | api.getnatalia.com |
443 |
TCP / HTTPS | Outbound | YES |
| No inbound port required on the Agent VM | — | — | — | Security best practice | |
→ Full threat model in the security architecture.
Operational errors by code (NAT-AGT-001 → 010)
Quick-fix table. Full reference (API codes included): error codes.
| Code | Symptom | Cause | Resolution |
|---|---|---|---|
| NAT-AGT-001 | SCP timeout to the PBX | Network split or firewall blocking 22 | Auto retry (exponential backoff). Check firewall. |
| NAT-AGT-002 | Taxation license missing on OXE | spadmin -l does not list the account package |
Contact your Alcatel-Lucent integrator |
| NAT-AGT-003 | SCP Permission denied | /etc/shells or ACL on /usr4/account/ |
setfacl -R -m u:natalia-collector:r /usr4/account/ |
| NAT-AGT-004 | SSH connection refused | SSH not enabled on the OXE | netadmin -m → Security → SSH |
| NAT-AGT-005 | /usr4/account path not found |
Non-standard OXE install | find /usr -name "TAXA*.DAT" |
| NAT-AGT-006 | OXO JWT expired | Polling > 60 min (TTL JWT) | Set polling to 30 min |
| NAT-AGT-007 | Disk buffer > 80% | API unreachable or polling too frequent | Wait for API recovery, or drain queue |
| NAT-AGT-008 | Disk buffer > 95% — polling PAUSED | Cloud API unreachable ~3 days | Restore API, polling resumes automatically |
| NAT-AGT-009 | Admin password lost | vTPM absent on the host | Redeploy a fresh OVA (irrecoverable by design) |
| NAT-AGT-010 | CDR parser failed | .DAT not decompressed before parsing |
gunzip on the agent, requeue file |
OXE SCP errors — raw shell messages
Reverse lookup: shell error message to root cause and fix. Useful when you reproduce the SCP call manually from the agent VM (scp natalia-collector@<OXE-IP>:/usr4/account/...).
| Shell error message | Root cause | Resolution |
|---|---|---|
ssh: connect to host ... port 22: Connection refused |
SSH not enabled on OXE | netadmin -m → Security → SSH → enable |
Permission denied (publickey,password) |
Wrong login/password | Check /etc/passwd + trusted hosts |
scp: No such file or directory |
Taxation license missing — no .DAT files produced | spadmin -l | grep -i account |
scp: ... Permission denied |
Restrictive ACL on /usr4/account/ | setfacl -R -m u:natalia-collector:r /usr4/account/ |
ssh_exchange_identification: Connection closed |
Agent IP not in /etc/hosts.allow on OXE | Add agent IP via netadmin |
Warning: Remote host identification has changed |
OXE was replaced / rekeyed | ssh-keygen -R <OXE-IP> on agent VM |
| Connection OK but 0 .DAT files | Accounting not configured in OMC | Verify Accounting setup in OMC |
.DAT file not parseable |
Compressed binary read as CSV | file TAXA*.DAT then gunzip |
Buffer & retention behavior
Agent behavior when the Natalia Cloud API is down for hours/days
The agent buffers up to 3 days on disk and retries with exponential backoff. The web UI shows a "Sync delayed" badge with the queued file count. Past 3 days, the PBX still holds its CDR (OXE keeps 31 days in /usr4/account/), so no data is lost even in the worst case.
Agent behavior when its own buffer fills up
Above 80% disk (NAT-AGT-007), an email alert fires. Above 95% (NAT-AGT-008), polling stops to avoid corruption. The PBX retention takes over until connectivity is restored.
More context: the glossary (SCP, vTPM, JWT, WMC...) or the complete error code reference.