feat(analyzer): add speech candidate detection for adaptive tuning #4

flexiondotorg · 2026-01-15T11:28:48Z

Summary

Add speech candidate detection to identify representative speech regions for future adaptive filter tuning. This complements the existing silence detection by providing measurements of typical speech characteristics.

Changes

Speech Detection (commit 1)

Add SpeechRegion and SpeechCandidateMetrics data structures
Implement interval-based speech detection after elected silence
Score speech regions by amplitude, centroid, and entropy
Select longest qualifying candidate for speech profiling
Integrate detection into MeasureInput analysis pipeline
Add diagnostic output to processing reports
Add 21 unit tests for speech detection logic

Expanded Metrics (commit 2)

Add all spectral metrics (mean, variance, spread, skewness, crest, flux, slope, decrease, rolloff) to silence and speech candidates
Add loudness metrics (momentary/short-term LUFS, true/sample peak)
Update measurement functions to populate complete metric set
Enhance diagnostic report with organised metric groups

Design

Speech detection runs after silence detection, searching only after the elected silence region ends
Uses 30-second minimum duration with 2-second interruption tolerance for natural pauses
Scoring prioritises amplitude (50%), voice-range centroid (30%), and low entropy (20%)
Selection prefers longest duration above quality threshold (unlike silence which prefers earliest)

Testing

All existing tests pass. New tests cover:

TestSpeechScore — 6 test cases for speech scoring
TestFindSpeechCandidatesFromIntervals — 6 test cases for detection logic
TestMeasureSpeechCandidateFromIntervals — 2 test cases for metrics extraction
TestFindBestSpeechRegion — 3 test cases for candidate selection
TestScoreSpeechCandidate — 4 test cases for candidate scoring

Implements docs/PLAN-SpeechDetection.md

- Add SpeechRegion and SpeechCandidateMetrics data structures - Implement interval-based speech detection after elected silence - Score speech regions by amplitude, centroid, and entropy - Select longest qualifying candidate for speech profiling - Integrate detection into MeasureInput analysis pipeline - Add diagnostic output to processing reports Signed-off-by: Martin Wimpress <[email protected]>

…urements - Add all spectral metrics (mean, variance, spread, skewness, crest, flux, slope, decrease, rolloff) to silence and speech candidates - Add loudness metrics (momentary/short-term LUFS, true/sample peak) - Update measurement functions to populate complete metric set - Enhance diagnostic report with organised metric groups Enables future adaptive filter tuning using full audio characteristics. Signed-off-by: Martin Wimpress <[email protected]>

flexiondotorg · 2026-01-15T11:36:29Z

@cubic-dev-ai Review this pull request

cubic-dev-ai · 2026-01-15T11:36:34Z

@cubic-dev-ai Review this pull request

@flexiondotorg I have started the AI code review. It will take a few minutes to complete.

cubic-dev-ai

1 issue found across 3 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="internal/processor/analyzer.go">

<violation number="1" location="internal/processor/analyzer.go:2777">
P1: If no interval exists at or after `searchStart`, `startIdx` remains 0 and speech detection incorrectly searches from the beginning of the file instead of returning nil. This could detect speech within the silence region for short recordings or when silence is near the end.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

internal/processor/analyzer.go

Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>

flexiondotorg added 2 commits January 15, 2026 11:12

cubic-dev-ai bot reviewed Jan 15, 2026

View reviewed changes

internal/processor/analyzer.go Outdated Show resolved Hide resolved

fix: ensure interval start is found after start index

1f40b8c

Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>

flexiondotorg merged commit 536273c into main Jan 15, 2026
5 checks passed

flexiondotorg deleted the speech-detection branch January 15, 2026 12:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(analyzer): add speech candidate detection for adaptive tuning #4

feat(analyzer): add speech candidate detection for adaptive tuning #4

Uh oh!

flexiondotorg commented Jan 15, 2026

Uh oh!

flexiondotorg commented Jan 15, 2026

Uh oh!

cubic-dev-ai bot commented Jan 15, 2026

Uh oh!

cubic-dev-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(analyzer): add speech candidate detection for adaptive tuning #4

feat(analyzer): add speech candidate detection for adaptive tuning #4

Uh oh!

Conversation

flexiondotorg commented Jan 15, 2026

Summary

Changes

Speech Detection (commit 1)

Expanded Metrics (commit 2)

Design

Testing

Uh oh!

flexiondotorg commented Jan 15, 2026

Uh oh!

cubic-dev-ai bot commented Jan 15, 2026

Uh oh!

cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants