TalkLot — PJ Rodriguez

The Problem

Most language-learning apps optimize for vocabulary acquisition: flashcards, listening comprehension, multiple-choice drills. Speaking is treated as a downstream reward, something you unlock after enough passive study. But fluency in conversation does not work that way. Oral production requires repetition under pressure, immediate feedback, and volume of practice that existing tools do not support.

The research question this project pursued: how might a language learner build genuine spoken fluency through structured, high-volume oral practice integrated directly into their study routine?

Research

Competitive Analysis

Eight language-learning platforms were evaluated across four dimensions: speaking practice tools, feedback mechanisms, gamification structure, and community features. The pattern was consistent. Apps built around gamification and vocabulary retention dominated the market. Dedicated speaking practice was rare and, where it existed, was limited to pronunciation checking against a recorded model rather than sustained oral production. Duolingo, shown in detail, exemplifies the dominant pattern: a gamified, vocabulary-first product with no dedicated speaking practice mode.

Netnography

Key insight callout: learners report freezing in real conversation despite years of passive study

A netnographic study of language-learning forums and communities surfaced the frustration directly. Learners who had studied a language for years described freezing in real conversations, losing words they had drilled extensively, and feeling that their study time had not prepared them to actually speak. The gap between passive study and active production was the dominant theme.

Secondary Research

Linguistic research supported what the netnography surfaced. Extensive oral production — defined as high-volume low-stakes speaking practice — is among the most effective methods for building spoken fluency. Output-based learning forces the brain to retrieve and activate vocabulary differently than input-based methods do. Learners who speak more, even imperfectly, close the fluency gap faster.

Synthesis

Storyboard mapping a learner's TalkLot session from launch through practice and review

Three research methods pointing to the same finding made the design direction clear. A language learner needed a tool built specifically around oral output: not a supplement to existing apps, but a dedicated space for speaking practice. I created a storyboard in Adobe Illustrator to map a learner's session from launch through practice and review, which served as the interaction framework for prototyping.

Prototyping

Microinteractions

Dark theme microinteraction prototype mockup showing button states and transitions

Dark theme microinteraction detail callout

Light theme microinteraction detail callout

Light theme microinteraction demo

Early prototyping focused on microinteraction states: how buttons respond, how transitions signal mode changes, how feedback appears without interrupting the practice session. Two color themes were developed in parallel, dark and light, to evaluate legibility and focus across ambient conditions. The light theme performed better in usability checks and became the primary direction. Screen recordings of prototype interactions were captured with Windows Movie Maker, the tool available at the time.

Teleprompter

Teleprompter scroll driven by voice input

As research expanded to cover oral production outside of language-learning contexts, tools used in live presentations suggested a teleprompting format. In TalkLot's teleprompter feature, the user's voice triggers a downward scroll of the passage text in real time, creating a touchless, speech-driven reading experience. Early prototyping showed that highlighting words at completion aided pacing, but sub-word highlighting at the syllable level may better support pronunciation and liaison. User testing would determine the optimal unit of highlight.

Translation and Vocabulary

Translation overlay feature showing source and target language text in context

Vocabulary hover interaction with background blur

Two features addressed the gap between reading and comprehension during practice. The translation overlay allowed learners to access meaning in context without leaving the session. The vocabulary hover interaction surfaced word definitions and alternatives when a learner paused on a term, with a background blur that kept the word in focus without removing it from sentence context.

Pronunciation Trainer

Pronunciation trainer with session skill curve

The pronunciation trainer recorded and displayed a learner's spoken output against a target model, generating a visual skill curve across sessions. Rather than a binary pass/fail judgment, the tracker showed trajectory, rewarding consistent practice regardless of whether individual attempts matched the target exactly.

Testing

High-fidelity prototype testing was conducted with five participants over two sessions. Testers included two intermediate Spanish learners, one advanced French learner, one beginner Mandarin learner, and one language educator reviewing the tool as an instructional aid.

Key findings: the teleprompter scroll rate needed user control rather than a fixed voice-detection threshold; the vocabulary hover interaction was intuitive without instruction; the pronunciation tracker was described as motivating by four of five participants; and the translation overlay required a clearer visual distinction between source and target language text.

Each finding produced a specific revision. Scroll rate became an adjustable setting. The translation overlay received a typographic treatment separating the two languages by weight and color. The pronunciation tracker's display range was extended to show the full session arc rather than only the most recent attempts.

Interact with the prototype →

Reflection

TalkLot was a solo project completed in Senior Studio at Santa Monica College, and the only project in the portfolio where every research, synthesis, and design decision was mine alone. That constraint clarified something: a thesis formed earlier and held longer when there was no one to defer to.

The research converged faster than expected. Three methodologies aligned on the same problem, which made synthesis straightforward. The harder design work was in the prototyping phase, specifically in figuring out how to make speaking feel supported rather than evaluated. The pronunciation tracker took several iterations to land on a display that felt like a record rather than a grade.

The features that tested best — vocabulary hover and pronunciation tracking — were also the furthest from what existing apps offered. That was the confirmation the research had been pointing toward: the gap was real and the direction was right.