We examined gestural coordination in C1C2 (C1 stop, C2 lateral or tap) word initial clusters using articulatory (electromagnetic articulometry) and acoustic data from six speakers of Standard Peninsular Spanish. We report on patterns of voice onset time (VOT), gestural plateau duration of C1, C2, and their overlap. For VOT, as expected, place of articulation is a major factor, with velars exhibiting longer VOTs than labials. Regarding C1 plateau duration, voice and place effects were found such that voiced consonants are significantly shorter than voiceless consonants, and velars show longer duration than labials. For C2 plateau duration, lateral duration was found to vary as a function of onset complexity (C vs. CC). As for overlap, unlike in French, where articulatory data for clusters have also been examined, clusters where both C1 and C2 are voiced show more overlap than where voicing differs. Further, overlap was affected by the C2 such that clusters where C2 is a tap show less overlap than clusters where C2 is a lateral. We discuss these results in the context of work aiming to uncover phonetic (e.g., articulatory or perceptual) and phonological forces (e.g., syllabic organization) on timing.