When you talk to a stranger online, the medium shapes the conversation as much as the words do. Text, voice, and video each create a genuinely different experience — and knowing when to use each makes a real difference. Here is a practical breakdown of what each mode does well, and when to switch.
Text Chat: The Default Starting Point
Text is low-commitment. You can think before you respond. There's no pressure to fill silence. You can be funnier, more precise, or more honest than you might be in real-time speech. Anonymity feels most complete in text — there's no voice tone or facial expression to read, which is either freeing or distancing depending on the conversation.
Text is best for: starting conversations with strangers (lower barrier), philosophical or abstract discussions where precision matters, situations where you want to think carefully before responding, and conversations where the other person's identity is irrelevant.
Text is not ideal for: conversations with heavy emotional weight (tone is easily misread), moments that need warmth or comfort, or when the subject is genuinely complex and nuanced.
Voice Chat: The Middle Ground
Voice adds tone, pace, hesitation, laughter — all the things that make someone feel like a real person rather than a text box. It's significantly warmer than text without the vulnerability of being seen. Most people find voice easier to sustain than text for a long conversation because it flows more naturally.
Voice is best for: conversations that have already found their depth, situations where you want warmth without being on camera, longer discussions where typing becomes tedious, and moments of genuine emotional exchange.
Voice is not ideal for: situations where you're in a noisy environment, early in a conversation when you're not sure where it's going, or when you want time to compose your thoughts carefully.
Video Chat: The Fullest Connection
Video is the closest thing online to actually meeting someone. You see expressions, reactions, the physical space they inhabit. Conversations go to more interesting places faster because you're both more present. The accountability of being seen also tends to make people more thoughtful — harder to be rude when you're face-to-face.
Video is best for: conversations that have already established some warmth and trust, when you want to feel genuinely seen and not just heard, and when the subject benefits from full human presence.
Video is not ideal for: early stranger conversations where you haven't built any rapport, situations where your physical environment would reveal personal information, or when you just want to talk without the overhead of being on camera.
The Natural Progression
On OurStranger, most good conversations follow a natural arc: text to get a feel for the person, voice when there's genuine interest, video when there's real connection. You can switch between all three mid-conversation with a single tap — no ending the call, no re-connecting. This is deliberate. The best conversations are not static.
The progression is not a rule — some conversations stay in text and are better for it. Some go straight to voice. The point is that you have all three available, and you can move between them based on where the conversation naturally wants to go.
What the Research Says
Studies on online communication consistently show that richer media (video > voice > text) increases perceived social presence and typically leads to more disclosure and more trust — but at the cost of lower anonymity and higher vulnerability. The "media richness" tradeoff is real. This is why having all three modes available matters: different conversations call for different levels of presence.