iOS dictation re-fires input events with text split into words

OS: iOS 17.0+ 기기: iPhone or iPad Any 브라우저: Safari 17.0+ 키보드: Voice Dictation 초안

dictationvoice-inputbeforeinputinputiossafariduplicate-eventssync-issue

Phenomenon

On iOS Safari, when using voice dictation to input text into contenteditable elements, the system fires initial beforeinput and input events with the complete dictated text. After the initial input completes, the system re-fires beforeinput and input events with the text split into individual words, causing event handlers to execute multiple times for the same input.

Reproduction example

Open a web page with a contenteditable element on iOS Safari (iPhone or iPad).
Focus the contenteditable element.
Activate voice dictation (long press spacebar or tap microphone icon on keyboard).
Dictate text: “만나서 반갑습니다” (or any multi-word phrase).
Observe the beforeinput and input events in the browser console or event log.

Observed behavior

Initial Dictation Sequence

User activates dictation and speaks “만나서 반갑습니다”
beforeinput event fires with:
- inputType: 'insertText'
- data: '만나서 반갑습니다'
- isComposing: false
input event fires with the complete text “만나서 반갑습니다” inserted into DOM

Duplicate Events Sequence (Bug)

After a short delay (typically 100-500ms), beforeinput event fires again with:
- inputType: 'insertText'
- data: '만나서'
- isComposing: false
input event fires with “만나서” inserted
beforeinput event fires again with:
- inputType: 'insertText'
- data: ' ' (space character)
- isComposing: false
input event fires with space inserted
beforeinput event fires again with:
- inputType: 'insertText'
- data: '반갑습니다'
- isComposing: false
input event fires with “반갑습니다” inserted

Key Characteristics

Composition events (compositionstart, compositionupdate, compositionend) do NOT fire during dictation
isComposing is always false in all events
Events are re-fired after the initial input completes
Text is split at word boundaries (spaces)
The DOM state after duplicate events is the same as after initial events (no actual change)
Event sequence becomes out of sync with DOM state

Expected behavior

Initial beforeinput and input events should fire once with the complete dictated text
Events should NOT be re-fired after completion
If events are re-fired, they should reflect actual DOM changes (not duplicate insertions)
Event sequence should remain synchronized with DOM state
Composition events should fire during dictation (as they do on macOS Safari)

Impact

This can lead to:

Duplicate processing: Event handlers execute multiple times for the same input
State synchronization issues: Application state may become inconsistent with DOM state
Performance issues: Unnecessary processing of duplicate events
Undo/redo corruption: Undo stack may contain duplicate or incorrect entries
Validation issues: Validation logic may run multiple times on the same input
Formatting issues: Formatting logic may be applied incorrectly due to split text
Event sequence confusion: Handlers expecting a single input event receive multiple events

Browser Comparison

iOS Safari: Dictation does not fire composition events, events are re-fired after completion with text split into words
iOS Chrome: Same behavior as Safari (uses WebKit engine, required by Apple)
macOS Safari: Dictation fires composition events, events are not re-fired after completion
Chrome/Edge/Firefox (Desktop): Dictation behavior varies but generally more consistent, no duplicate re-firing

Distinguishing Dictation Input

Important: There is no reliable way to detect dictation input in web applications on iOS. Web APIs do not provide dictation detection capabilities, and native iOS APIs like UITextInputContext.isDictationInputExpected are not available in web contexts.

Potential Indicators (Not Reliable)

Absence of composition events (but this also occurs with Korean IME on iOS)
Rapid insertion of multiple words
Text appears to be split and re-inserted
Events fire in quick succession with complete words
isComposing is always false (but this is also true for Korean IME on iOS)

These indicators are not definitive and may produce false positives.

Event Sequence

The sequence of events when inputting “만나서 반갑습니다” via dictation:

Phase 1: Initial Dictation Input

Order	Event	inputType	data	DOM State (before)	DOM State (after)
1	beforeinput	insertText	’만나서 반갑습니다’	""	-
2	input	insertText	’만나서 반갑습니다’	""	"만나서 반갑습니다” ✅

Phase 2: Duplicate Events (Bug)

After initial input completes, after a delay of approximately 100-500ms, events are re-fired with text split into words:

Order	Event	inputType	data	DOM State (before)	DOM State (after)
3	beforeinput	insertText	’만나서‘	“만나서 반갑습니다”	-
4	input	insertText	’만나서‘	“만나서 반갑습니다"	"만나서 반갑습니다” ❌
5	beforeinput	insertText	’ ‘	“만나서 반갑습니다”	-
6	input	insertText	’ ‘	“만나서 반갑습니다"	"만나서 반갑습니다” ❌
7	beforeinput	insertText	’반갑습니다‘	“만나서 반갑습니다”	-
8	input	insertText	’반갑습니다‘	“만나서 반갑습니다"	"만나서 반갑습니다” ❌

Key Characteristics

Events 1-2: Complete text inserted at once (DOM actually changes)
Events 3-8: Text re-fired word-by-word but DOM doesn’t change
Composition events: compositionstart, compositionupdate, compositionend events do NOT fire in any phase
isComposing: All events have isComposing: false
Delay between phases: 100-500ms delay between Event 2 and Event 3

Complete Event Monitoring

Code to monitor all events during iOS dictation input:

const element = document.querySelector('[contenteditable]');
const eventLog = [];

const eventsToMonitor = [
  'compositionstart', 'compositionupdate', 'compositionend',
  'beforeinput', 'input',
  'keydown', 'keyup', 'keypress'
];

eventsToMonitor.forEach(eventType => {
  element.addEventListener(eventType, (e) => {
    const eventData = {
      timestamp: Date.now(),
      type: eventType,
      inputType: e.inputType || null,
      data: e.data || null,
      isComposing: e.isComposing || false,
      textContent: element.textContent
    };
    eventLog.push(eventData);
    console.log(`[${eventType}]`, eventData);
  }, { capture: true });
});

Events That Fire vs Do Not Fire

Event Type	Fires?	Initial Input	Duplicate Events
`beforeinput`	✅ Yes	1 time	3 times
`input`	✅ Yes	1 time	3 times
`compositionstart`	❌ No	-	-
`compositionupdate`	❌ No	-	-
`compositionend`	❌ No	-	-
`keydown`	❌ No	-	-
`keyup`	❌ No	-	-
`keypress`	❌ No	-	-

Notes and possible direction for workarounds

Event Handling Considerations

Event handlers may execute multiple times for the same input
Events without actual DOM changes (Events 4, 6, 8) should not be processed
Check textContent to determine if DOM actually changed

Undo/Redo Stack

Recording duplicate events in undo stack creates duplicate undo entries
Only record in undo stack when there’s an actual DOM change

Additional Considerations

Selection State

Selection state may be reset when duplicate events fire, so don’t use selection for duplicate detection; trust only textContent

Undo/Redo Stack

Recording duplicate events in the undo stack creates duplicate undo entries
Using textContent-based deduplication ensures only actual changes are recorded in the undo stack

Voice Control Simultaneous Use

Enabling both Voice Control and Dictation in iOS settings may cause text to be inserted twice
This case actually changes the DOM, so textContent-based deduplication won’t detect it
Recommend users enable only one

Test Environment

iOS Version	Browser	Language	Reproduced
iOS 16.x	Safari	Korean	✅ Confirmed
iOS 16.x	Safari	English	✅ Confirmed
iOS 16.x	Chrome iOS	Korean	✅ Confirmed
iOS 17.x	Safari	Korean	✅ Confirmed
iOS 17.x	Safari	English	✅ Confirmed
iOS 17.x	Chrome iOS	Korean	✅ Confirmed
iOS 18.x	Safari	Korean	⚠️ Unconfirmed
iOS 18.x	Safari	English	⚠️ Unconfirmed

Note: The same issue likely occurs across all iOS versions (shared WebKit engine). Issue appears to occur regardless of language.

이 시나리오의 변형

케이스	OS	브라우저	상태
ce-0293-ios-dictation-duplicate-events-safari-ko	iOS 17.0+	Safari 17.0+	초안
ce-0294-ios-dictation-duplicate-events-safari-ko	iOS 17.0+	Safari 17.0+	초안

Playground for this case

Use the reported environment as a reference and record what happens in your environment while interacting with the editable area.

Reported environment

OS: iOS 17.0+

Device: iPhone or iPad Any

Browser: Safari 17.0+

Keyboard: Voice Dictation

Your environment

OSDeviceBrowserKeyboard

Sample HTML:

Event log

Use this log together with the case description when filing or updating an issue.

0 events

Interact with the editable area to see events here.