&lt;p&gt;Article&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;b&gt;Introduction&amp;nbsp; &lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;text-indent:36pt; margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dans cet article le but est de proposer un &amp;eacute;tat de l&amp;rsquo;art de la variation en sciences du langage dans la perspective du TAL. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Si la d&amp;eacute;finition de ce qu&amp;rsquo;est la norme pose d&amp;eacute;j&amp;agrave; nombre de probl&amp;egrave;mes en linguistique (Siouffi &amp;amp; Steuckardt, 2007), dans le domaine du TAL le d&amp;eacute;fi d&amp;rsquo;&amp;eacute;tablir un contour pr&amp;eacute;cis de norme et &amp;ndash; par la suite &amp;ndash; de ce qui est &amp;agrave; consid&amp;eacute;rer comme variation autour de ladite norme assume des formes diff&amp;eacute;rentes qui s&amp;rsquo;expriment sur d&amp;rsquo;autres niveaux d&amp;rsquo;analyse.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dans cet article il n&amp;rsquo;est pas sujet de retracer l&amp;rsquo;histoire des d&amp;eacute;finitions du concept de &amp;laquo;&amp;nbsp;norme&amp;nbsp;&amp;raquo; en linguistique, toutefois il est pertinent de noter comment les d&amp;eacute;bats autour de la norme (ainsi qu&amp;rsquo;autour de ses variations) pivotent souvent autour du noyau &amp;eacute;pist&amp;eacute;mologique qui suit&amp;nbsp;:&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&amp;laquo;&amp;nbsp;Devra-t-on d&amp;eacute;crire la langue &amp;agrave; partir de faits linguistiques observables, c&amp;#39;est-&amp;agrave;-dire les performances diverses et vari&amp;eacute;es auxquelles on est expos&amp;eacute;s dans la vie quotidienne ou bien penser la langue &amp;agrave; partir de&lt;br /&gt;
comp&amp;eacute;tences id&amp;eacute;alis&amp;eacute;es ?&amp;nbsp;&amp;raquo; (Barge, 2009)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Que l&amp;rsquo;on veuille rendre compte des diversit&amp;eacute;s dialectales, diachroniques, sociolinguistiques ou pas ; que l&amp;rsquo;on milite en faveur d&amp;rsquo;un usage pr&amp;eacute;scriptif et &amp;eacute;valuatif de la langue ou bien que l&amp;rsquo;on accepte tout type de variation linguistique - pourvu qu&amp;rsquo;elle puisse toujours garantir la transmission du sens ainsi que sa compr&amp;eacute;hension mutuelle sans d&amp;eacute;faillance - la richesse de la langue fran&amp;ccedil;aise pose d&amp;eacute;j&amp;agrave; une quantit&amp;eacute; de &amp;laquo;&amp;nbsp;variations norm&amp;eacute;es&amp;nbsp;&amp;raquo; non n&amp;eacute;gligeables. Par cette expression l&amp;rsquo;auteur de cet article voudrait d&amp;eacute;finir tout ph&amp;eacute;nom&amp;egrave;ne linguistique qui &amp;ndash; &amp;agrave; l&amp;rsquo;oral comme dans sa forme &amp;eacute;crite &amp;ndash; ne suit pas la r&amp;egrave;gle, c&amp;rsquo;est-&amp;agrave;-dire ce qui est usuellement pr&amp;eacute;vu pour le m&amp;ecirc;me &amp;eacute;l&amp;eacute;ment dans le m&amp;ecirc;me contexte. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Parmi ces &amp;laquo;&amp;nbsp;variations norm&amp;eacute;es&amp;nbsp;&amp;raquo; &amp;agrave; l&amp;rsquo;oral on trouve entre autres le hiatus, les diff&amp;eacute;rentes formes de liaisons, les verbes irr&amp;eacute;guliers. Alors qu&amp;rsquo;&amp;agrave; l&amp;rsquo;&amp;eacute;crit ces variations se multiplient&amp;nbsp;: l&amp;rsquo;orthographe du fran&amp;ccedil;ais &amp;eacute;tant opaque, le nombre d&amp;rsquo;homographes/homophones ou bien d&amp;rsquo;homophones non homographes (ou bien encore son inverse) ne sont que la pointe de l&amp;rsquo;iceberg d&amp;rsquo;une multitude de &amp;laquo;&amp;nbsp;variations norm&amp;eacute;es&amp;nbsp;&amp;raquo;.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Mais alors, qu&amp;rsquo;est-ce qu&amp;rsquo;est la norme&amp;nbsp;?&amp;nbsp; Est-ce qu&amp;rsquo;il s&amp;rsquo;agit exclusivement d&amp;rsquo;un usage non conforme qui diff&amp;egrave;re en fonction du dialecte, du temps, de la classe sociale ou de l&amp;rsquo;etnie&amp;nbsp;? Ou peut-on consid&amp;eacute;rer la variation comme toute deviation d&amp;rsquo;un ensemble de crit&amp;egrave;res logiques sur lesquels une langue naturelle devrait se baser&amp;nbsp;? &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Effectivement, si l&amp;rsquo;on adopte la d&amp;eacute;finition suivante de norme &amp;laquo; Tout ce qui est d&amp;#39;usage commun et courant dans une communaut&amp;eacute; linguistique ; la norme correspond alors &amp;agrave; l&amp;#39;institution sociale que constitue la langue. &amp;raquo; (Dubois et al., 1973, p 342) on pourrait r&amp;eacute;pondre &amp;agrave; la premi&amp;egrave;re question pos&amp;eacute;e dans le paragraphe ci-dessus. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La r&amp;eacute;ponse &amp;agrave; la deuxi&amp;egrave;me question est bien plus difficile, et la formulation m&amp;ecirc;me de cette question ouvre la voie &amp;agrave; plusieurs niveaux d&amp;rsquo;analyse. Tout d&amp;rsquo;abord, l&amp;rsquo;orthographe du fran&amp;ccedil;ais est tout sauf logique (Hoedt &amp;amp; Piron, 2016)&amp;nbsp;: par exemple, si l&amp;rsquo;on prend un nouveau mot qui n&amp;rsquo;existe pas mais qui respecte les r&amp;egrave;gles phonotactiques du fran&amp;ccedil;ais, &lt;i&gt;i.e&lt;/i&gt; le mot / kʁefisjɔ̃ / (Hoedt &amp;amp; Piron, 2016), comment pourrait-on le transcrire de mani&amp;egrave;re &amp;agrave; respecter les normes de l&amp;rsquo;orthographe du fran&amp;ccedil;ais&amp;nbsp;?&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&amp;laquo;&amp;nbsp;Krefision&amp;nbsp;&amp;raquo; ou &amp;laquo;&amp;nbsp; krefisiont&amp;nbsp;&amp;raquo;&amp;nbsp;? Certes, mais aussi &amp;laquo;&amp;nbsp;crephission&amp;nbsp;&amp;raquo; ou bien &amp;laquo;&amp;nbsp;crefition&amp;nbsp;&amp;raquo; ou &amp;laquo;&amp;nbsp;chraisfiscion&amp;nbsp;&amp;raquo; devraient &amp;ecirc;tre consid&amp;eacute;r&amp;eacute;s comme des candidats conformes.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Toutes ces formes sont possibles selon l&amp;rsquo;orthographe du fran&amp;ccedil;ais, aucune ne pourrait &amp;ecirc;tre jug&amp;eacute;e comme &amp;eacute;tant hors-norme ou atypique. &lt;a name=&quot;_Hlk120872744&quot;&gt;Un algorithme programm&amp;eacute; &amp;agrave; cet effet &amp;ndash; gr&amp;acirc;ce &amp;agrave; un calcul combinatoire qui tient en compte toutes les lettres et/ou syllabes homophones non homographes &amp;ndash; a produit comme output le nombre total de transcription possible du mot invent&amp;eacute; &lt;/a&gt;/ kʁefisjɔ̃ /&amp;nbsp;: elles sont 240&amp;nbsp;(Hoedt &amp;amp; Piron, 2016). Il est clair qu&amp;rsquo;il est difficile parler de norme et de variation quand la norme orthographique ne derive &amp;ndash; au moins dans un bon nombre de cas &amp;ndash; que d&amp;rsquo;une association majoritairement arbitraire qui relie un phon&amp;egrave;me et son/ses graph&amp;egrave;me(s) correspondant(s).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Les auteurs de cet ouvrage se demandent pourquoi &amp;laquo;&amp;nbsp;l&amp;rsquo;esprit critique s&amp;rsquo;arr&amp;ecirc;te aux seuils de l&amp;rsquo;orthographe&amp;nbsp;&amp;raquo; (Hoedt &amp;amp; Piron, 2016). Le manque d&amp;rsquo;univocit&amp;eacute; dans la relation entre graph&amp;egrave;me et phon&amp;egrave;me donne &amp;agrave; l&amp;rsquo;orthographe du fran&amp;ccedil;ais un caract&amp;egrave;re particulier, qui est commun &amp;agrave; d&amp;rsquo;autres langues (par exemple l&amp;rsquo;anglais ou l&amp;rsquo;allemand). Les langues qui ont une orthographe totalement claire sont relativement peu, comme l&amp;rsquo;espagnol ou le turc par exemple (&amp;agrave; noter que l&amp;rsquo;alphabet latin &amp;agrave; &amp;eacute;t&amp;eacute; introduit dans le XX&amp;egrave;me si&amp;egrave;cle en Turquie, et qui a fait l&amp;rsquo;objet d&amp;rsquo;une adaptation de haut en bas&amp;nbsp;: l&amp;rsquo;usage s&amp;rsquo;est d&amp;eacute;fini une fois que la norme avait &amp;eacute;t&amp;eacute; d&amp;eacute;j&amp;agrave; &amp;eacute;tablie par la nouvelle forme &amp;eacute;tatique).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Apr&amp;egrave;s cette petite digr&amp;eacute;ssion, il faut noter que pour l&amp;rsquo;ordinateur les variations sont toujours les m&amp;ecirc;mes puisqu&amp;rsquo;elles posent constamment le m&amp;ecirc;me probl&amp;egrave;me&amp;nbsp;: l&amp;rsquo;ambig&amp;uuml;it&amp;eacute; (Kraif &amp;amp; Ponton, 2007&amp;nbsp;; Jusoh, 2018).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Tout ce qui sort du cadre d&amp;rsquo;une logique d&amp;eacute;terminable et pr&amp;eacute;visible devient difficile pour un ordinateur&amp;nbsp;: calculer le type/token ratio de l&amp;rsquo;int&amp;eacute;gralit&amp;eacute; de l&amp;rsquo;Encyclop&amp;eacute;die de Diderot et d&amp;rsquo;Alembert est une t&amp;acirc;che simple, alors que mettre sur le m&amp;ecirc;me niveau ces deux expressions &amp;laquo;&amp;nbsp;je ne peux pas&amp;nbsp;&amp;raquo;, &amp;laquo;&amp;nbsp;je peux pas&amp;nbsp;&amp;raquo; devient compliqu&amp;eacute;. Le pourquoi - on le sait bien &amp;ndash; se trouve dans la d&amp;eacute;ductibilit&amp;eacute; des r&amp;egrave;gles &amp;agrave; appliquer et les exceptions &amp;agrave; accorder &amp;agrave; ces r&amp;egrave;gles&amp;nbsp;: si on a appris &amp;agrave; un programme &amp;agrave; reconna&amp;icirc;tre la n&amp;eacute;gation avec cette structure (sujet + ne + verbe + pas) il sera compliqu&amp;eacute; de lui faire d&amp;eacute;tecter la m&amp;ecirc;me entit&amp;eacute; dans un contexte o&amp;ugrave; un &amp;eacute;l&amp;eacute;ment manque. Il sera encore plus difficile de le rendre capable de reconna&amp;icirc;tre que dans certains contextes sociaux la premi&amp;egrave;re forme est obligatoire alors que dans d&amp;rsquo;autres contextes sociaux les deux formes sont acceptables.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Ces probl&amp;egrave;mes de multiplicit&amp;eacute; de transcriptions, d&amp;rsquo;alignement, de d&amp;eacute;sambigu&amp;iuml;sation en fonction du contexte sont pr&amp;eacute;sents dans toutes les branches de la linguistique qui utilisent le TAL pour automatiser des t&amp;acirc;ches r&amp;eacute;p&amp;eacute;titives, pour v&amp;eacute;rifier des hypoth&amp;egrave;ses ou bien pour proposer des repr&amp;eacute;sentations des grandes bases de donn&amp;eacute;es.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dans les deux parties de cet article deux &amp;eacute;tudes de cas seront propos&amp;eacute;es&amp;nbsp;: la premi&amp;egrave;re porte sur un calcul de frequ&amp;eacute;nce d&amp;rsquo;occurrence de mots et montrera comment la variation lexicale de l&amp;rsquo;enfant a &amp;eacute;t&amp;eacute; mod&amp;eacute;lis&amp;eacute;e pour faciliter l&amp;rsquo;automatisation d&amp;rsquo;une t&amp;acirc;che. Dans la deuxi&amp;egrave;me &amp;eacute;tude de cas plusieurs outils et escamotages seront pr&amp;eacute;sent&amp;eacute;s dans le cadre d&amp;rsquo;un essai visant &amp;agrave; uniformiser le traitement des variations phon&amp;eacute;tico/phonologiques chez l&amp;rsquo;enfant, dans le but ultime de d&amp;eacute;gager son parcours d&amp;rsquo;acquisition des phon&amp;egrave;mes. Ces exemples montrent comment le TAL soit devenu un outil incontournable dans le domaine de la linguistique gr&amp;acirc;ce &amp;agrave; sa puissance de calcul et &amp;agrave; sa rapidit&amp;eacute; d&amp;rsquo;ex&amp;eacute;cution. Cependant, son utilisation peut se r&amp;eacute;v&amp;eacute;ler insidieuse puisque la nature intrinsiquement ambig&amp;uuml;e et polys&amp;eacute;mique du langage implique un nombre non n&amp;eacute;glig&amp;eacute;able de biais et d&amp;rsquo;exceptions aux r&amp;egrave;gles. Comme il sera d&amp;eacute;taill&amp;eacute; dans les deux parties, le TAL nous am&amp;egrave;ne &amp;agrave; des d&amp;eacute;cisions importantes, souvent dans la forme d&amp;rsquo;un compromis ou d&amp;rsquo;une balance qu&amp;rsquo;il faut calibrer soigneusement&amp;nbsp;: par exemple&amp;nbsp;: est-il mieux de privil&amp;eacute;gier l&amp;rsquo;efficience en d&amp;eacute;pit de la pr&amp;eacute;cision, ou bien est-il mieux de chosir un biais dans le codage initial afin d&amp;rsquo;&amp;eacute;viter des probl&amp;egrave;mes de traitement de cat&amp;eacute;goriers par la suite, ou &amp;agrave; l&amp;rsquo;inverse est-il mieux de rendre compte de toute variation lors du codage, pour ensuite avoir des cat&amp;eacute;gories ayant des contours flous&amp;nbsp;? &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;b&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:12.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;Premi&amp;egrave;re &amp;eacute;tude de cas&amp;nbsp;: estimer l&amp;rsquo;&amp;eacute;volution de la distribution de Zipf chez l&amp;rsquo;enfant.&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;text-indent:36pt; margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Le corpus CoLaJE (Morgenstern &amp;amp; Parisse, 2012) est la base de cette &amp;eacute;tude sur l&amp;rsquo;acquisition du francais L1. Il est compos&amp;eacute; par sept suivis longitudinaux d&amp;rsquo;enfants qui ont ete enregistr&amp;eacute;s une heure par mois, tous les mois, d&amp;egrave;s l&amp;rsquo;&amp;acirc;ge d&amp;rsquo;un an jusqu&amp;rsquo;&amp;agrave; cinq ans environ. Le corpus respecte les standards de repr&amp;eacute;sentativit&amp;eacute; statistique demand&amp;eacute;s dans ce domaine (Tomasello &amp;amp; Stahl, 2004&amp;nbsp;; Yamaguchi, 2018).&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Pour chaque enfant il y a environ 8&amp;rsquo;000 &amp;eacute;nonc&amp;eacute;s et 20&amp;rsquo;000 mots avec une longueur moyenne d&amp;#39;&amp;eacute;nonc&amp;eacute; (Mean Length of Utterance, Mac Whinney, 2000) de trois mots. Le langage adress&amp;eacute; &amp;agrave; l&amp;#39;enfant a &amp;eacute;galement &amp;eacute;t&amp;eacute; enregistr&amp;eacute; et il est transcrit en utilisant les lignes FAT et MOT. Chaque transcription est soumise &amp;agrave; une relecture par un paire, afin que les interpr&amp;eacute;tations des expressions ambig&amp;uuml;es des enfants soient concord&amp;eacute;es par plusieurs chercheur.ses&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom: 11px; text-align: center;&quot;&gt;&lt;img height=&quot;266&quot; src=&quot;https://www.numerev.com/img/ck_2808_28_image-20221206185740-1.png&quot; width=&quot;545&quot; /&gt;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Figure 1. Extrait de CoLaJE. ADRIEN-33-4_02_15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;L&amp;rsquo;&amp;eacute;tude en question porte sur le d&amp;eacute;veloppement de la distribution de la fr&amp;eacute;quence des mots chez les enfants du corpus CoLaJE visant &amp;agrave; &amp;eacute;valuer comment leur production lexicale soit li&amp;eacute;e &amp;agrave; une distribution standard de la fr&amp;eacute;quence des mots : la loi de Zipf, qui est pr&amp;eacute;sente dans toutes les langues connues (Zipf, 1949&amp;nbsp;; Piantedosi, 2014). Dans le d&amp;eacute;tail, cette &amp;eacute;tude prend comme exemple des travaux pr&amp;eacute;c&amp;eacute;dents sur l&amp;rsquo;&amp;eacute;volution de cette distribution de frequ&amp;eacute;nce de mots qui avaient d&amp;eacute;j&amp;agrave; &amp;eacute;t&amp;eacute; effectu&amp;eacute;s sur plusieurs langues (Baixeries et al., 2013) en l&amp;rsquo;appliquant pour la premi&amp;egrave;re fois sur la langue fran&amp;ccedil;aise (Briglia et al., 2022). &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La distribution de Zipf est consid&amp;eacute;r&amp;eacute;e comme un standard d&amp;rsquo;efficience dans la communication humaine&amp;nbsp;(Lestrade, 2017) : une langue doit pouvoir v&amp;eacute;hiculer le sens de mani&amp;egrave;re pr&amp;eacute;cise tout en &amp;eacute;vitant de rendre cette t&amp;acirc;che trop co&amp;ucirc;teuse pour les locuteurs. Le principe du moindre effort (Zipf, 1949) &amp;oelig;uvre pour faire en sorte que la proportion entre &lt;i&gt;types&lt;/i&gt; et &lt;i&gt;tokens&lt;/i&gt; dans un corpus donn&amp;eacute; suffise pour atteindre le but communicatif&amp;nbsp;: si par exemple un auteur d&amp;rsquo;un article peut s&amp;rsquo;assurer de se faire comprendre en utilisant une gamme de 70 mots diff&amp;eacute;rent, il n&amp;rsquo;y aura aucune raison pour qu&amp;rsquo;il en utilise plus puisque la valeur communicative des mots qui exc&amp;egrave;dent par rapport &amp;agrave; la constante de Zipf ne vaut pas plus que le co&amp;ucirc;t cognitif de les traiter. La constante de la loi de Zipf est consid&amp;eacute;r&amp;eacute;e selon certains auteurs (Lestrade, 2017) comme un compromis implicite entre les locuteurs qui s&amp;rsquo;articule au niveau s&amp;eacute;mantique et syntaxique. Cette loi s&amp;rsquo;applique &amp;agrave; l&amp;rsquo;oral tout comme dans le texte, avec des variations n&amp;eacute;gligeables entre les deux formes (Piantedosi, 2014)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;L&amp;rsquo;int&amp;eacute;r&amp;ecirc;t de v&amp;eacute;rifier comment cette constante se d&amp;eacute;veloppe au cours de l&amp;rsquo;acquisition de la langue maternelle&amp;nbsp;est donc celui de comprendre comment le langage de l&amp;rsquo;enfant en &amp;eacute;volution se rapproche d&amp;rsquo;une norme adulte d&amp;rsquo;efficience dans la communication. Pour prouver cette hypoth&amp;egrave;se, il a fallu op&amp;eacute;rer un choix m&amp;eacute;thodologique commun au sein du TAL. La production langagi&amp;egrave;re des enfants du corpus CoLaJE qui ont &amp;eacute;t&amp;eacute; pris en examen se compose par trois lignes&amp;nbsp;(voir exemple en Figure 1) : &lt;i&gt;pho &lt;/i&gt;repr&amp;eacute;sente ce que l&amp;rsquo;enfant dit en API (Alphabet Phon&amp;eacute;tique International), &lt;i&gt;mod &lt;/i&gt;repr&amp;eacute;sente ce que l&amp;rsquo;enfant aurait d&amp;ucirc; prononcer selon la norme adulte en API, et CHI repr&amp;eacute;sente ce l&amp;rsquo;enfant aurait d&amp;ucirc; prononcer selon la norme adulte en orthographe standard. Avant de calculer la distribution de frequ&amp;eacute;nce de mots dans un enregistrement, il faut d&amp;rsquo;abord comprendre ce qu&amp;rsquo;un mot est pour un enfant (Vihman &amp;amp; McCune, 1994). Par exemple, pour le mot cible &amp;laquo;&amp;nbsp;comprendre&amp;nbsp;&amp;raquo;, Adrien&lt;a href=&quot;#_ftn1&quot; name=&quot;_ftnref1&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:11.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; &amp;agrave; l&amp;rsquo;&amp;acirc;ge de 4 ans et 3 mois (4_03_26) prononce les variations suivantes&amp;nbsp;:&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/ pʁopʁɑ̃d / et / kɔ̃pʁɑ̃d /. &amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Etant donn&amp;eacute; que le contexte est le suivant&amp;nbsp; &amp;lt;je vais je vais comprendre les lettres moi&amp;nbsp;!&amp;gt; et que le papa voulait lui faire faire des exercices de lecture de lettres, il est clair que les deux formes vari&amp;eacute;es ci-dessus se r&amp;eacute;f&amp;egrave;rent &amp;agrave; la m&amp;ecirc;me entit&amp;eacute; (&lt;i&gt;e.g&lt;/i&gt; le verbe &amp;lsquo;comprendre&amp;rsquo;). Il y a de nombreux cas analogues &amp;agrave; celui-ci (par exemple le mot &amp;lsquo;tracteur&amp;rsquo; ou &amp;lsquo;pourquoi&amp;rsquo;) qui conduisent &amp;agrave; un choix oblig&amp;eacute;&amp;nbsp;: si l&amp;rsquo;on garde un compte chaque variation phon&amp;eacute;tico/phonologiques de l&amp;rsquo;enfant, on ne pourra jamais &amp;eacute;tudier le d&amp;eacute;veloppement de la constante de Zipf dans ce corpus puisque le fait de consid&amp;eacute;rer toute variation va entra&amp;icirc;ner un nombre d&amp;rsquo;occurrence tr&amp;egrave;s &amp;eacute;lev&amp;eacute; alors que le r&amp;eacute;f&amp;eacute;rent est toujours le m&amp;ecirc;me. En d&amp;rsquo;autres termes, il y aura plusieurs &lt;i&gt;types&lt;/i&gt; diff&amp;eacute;rents alors qu&amp;rsquo;il n&amp;rsquo;y a &amp;ndash; selon une certaine perspective &amp;ndash; que plusieurs tokens diff&amp;eacute;rents qui se r&amp;eacute;f&amp;egrave;rent au m&amp;ecirc;me &lt;i&gt;type.&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;En outre, un mot donn&amp;eacute; peut &amp;ecirc;tre prononc&amp;eacute; de plusieurs mani&amp;egrave;res diff&amp;eacute;rentes avec des degr&amp;eacute;s de variations diff&amp;eacute;rents, ce qui rend les calculs complexes : il est difficile d&amp;#39;&amp;eacute;tablir avec certitude si un enfant donne &amp;agrave; un mot le m&amp;ecirc;me sens qu&amp;rsquo;un adulte lui attribue, par exemple de diff&amp;eacute;rences dues &amp;agrave; des erreurs de sous-extensions ou de sur-extension par les enfants (Thomson &amp;amp; Chapman, 1977) peuvent &amp;ecirc;tre &amp;agrave; l&amp;rsquo;&amp;oelig;uvre sans que l&amp;rsquo;on puisse en &amp;ecirc;tre conscients. Il est difficile d&amp;#39;&amp;eacute;tablir quand un mot signifie ce qu&amp;#39;il &amp;eacute;tait cens&amp;eacute; signifier pour un enfant, et dans quelle mesure diff&amp;eacute;rentes formes vari&amp;eacute;es se r&amp;eacute;f&amp;egrave;rent &amp;agrave; la m&amp;ecirc;me entit&amp;eacute;, notamment au cours des premiers &amp;acirc;ges (Vihman, 1994).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Il a donc &amp;eacute;t&amp;eacute; d&amp;eacute;cid&amp;eacute; &amp;ndash; dans le but d&amp;rsquo;homog&amp;eacute;n&amp;eacute;iser le corpus et rendre les comparaisons &lt;i&gt;inter&lt;/i&gt;-enfants possibles &amp;ndash; de baser la mod&amp;eacute;lisation TAL sur le r&amp;eacute;f&amp;eacute;rent sans tenir compte des diff&amp;eacute;rentes images acoustiques qui indiquaient ce dernier. Ce choix a impliqu&amp;eacute; l&amp;rsquo;acceptation de biais potentiels li&amp;eacute;s au choix des transcripteurs qui pour premiers avaient interpret&amp;eacute; la parole de l&amp;rsquo;enfant, ces biais sont difficiles &amp;agrave; estimer &amp;eacute;tant donn&amp;eacute; la taille du corpus.&amp;nbsp; Au niveau du TAL, il s&amp;rsquo;agit de rassembler un ensemble de variations sous une cat&amp;eacute;gorie unique li&amp;eacute;e au r&amp;eacute;f&amp;eacute;rent. Cela a permis de pouvoir traiter de mani&amp;egrave;re automatique une grande quantit&amp;eacute; de donn&amp;eacute;es issus des enfants de CoLaJE afin de d&amp;eacute;gager l&amp;rsquo;&amp;eacute;volution de la constante de la loi de Zipf au cours du temps (Briglia et al., 2022, p6-7). Il pourrait &amp;ecirc;tre r&amp;eacute;sum&amp;eacute; que le fait de renoncer &amp;agrave; une variation &amp;agrave; un niveau d&amp;rsquo;analyse (celui du mot) a permis de pouvoir analyser le r&amp;ocirc;le de la variation &amp;agrave; un niveau sup&amp;eacute;rieur (celui du lexique) selon une perspective temporelle qui met en relief les diff&amp;eacute;rences&lt;i&gt; inter&lt;/i&gt;-enfants.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La constante estim&amp;eacute;e est le param&amp;egrave;tre exponentiel de la distribution de la fr&amp;eacute;quence des mots (&lt;i&gt;alpha&lt;/i&gt;) pour chaque enfant, ainsi que pour le langage des parents. Nous montrons comment les valeurs de &lt;i&gt;alpha&lt;/i&gt; tendent &amp;agrave; converger vers la valeur de 1 au cours du d&amp;eacute;veloppement, ce qui est coh&amp;eacute;rent avec l&amp;rsquo;&amp;eacute;tat de l&amp;rsquo;art (Baixieries et al., 2013). Le choix entre variation et norme expliqu&amp;eacute; ci-dessus a permis aussi de rapprocher le langage de l&amp;rsquo;enfant &amp;agrave; celui de l&amp;rsquo;adulte, en &amp;eacute;tablissant ainsi les bases pour une comparaison entre l&amp;rsquo;exposant &lt;i&gt;alpha&lt;/i&gt; du langage des enfants et l&amp;rsquo;exposant &lt;i&gt;alpha&lt;/i&gt; des adultes&amp;nbsp;: le &lt;i&gt;rho&lt;/i&gt; de Spearman montre une corr&amp;eacute;lation positive (p-value &amp;lt; 0.05) entre l&amp;#39;&lt;i&gt;alpha&lt;/i&gt; de l&amp;#39;enfant et l&amp;#39;&lt;i&gt;alpha&lt;/i&gt; des parents au cours de tous les &amp;acirc;ges, qui augmente &amp;agrave; un &amp;acirc;ge plus avanc&amp;eacute; (Briglia et al., 2022, p184). Cela indique clairement que l&amp;rsquo;input parental joue un r&amp;ocirc;le de plus en plus important dans la structuration de l&amp;rsquo;output de l&amp;rsquo;enfant (Goodman et al., 2008).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Les trois graphes ci-dessous montrent la variation de l&amp;rsquo;exposant alpha au cours du temps. On pourrait consid&amp;eacute;rer alpha = 1 comme &amp;eacute;tant la norme puisqu&amp;rsquo;il a &amp;eacute;t&amp;eacute; d&amp;eacute;montr&amp;eacute; que cette valeur pour cet exposant donne le nombre optimal qui d&amp;eacute;crit combien de mots diff&amp;eacute;rents un extrait (&amp;eacute;crit ou oral) d&amp;rsquo;une taille donn&amp;eacute;e a en moyenne &amp;agrave; l&amp;rsquo;issue d&amp;rsquo;un compromis implicite atteint par les locuteurs (Zipf, 1949&amp;nbsp;; Piantadosi, 2014). Si l&amp;rsquo;on compare les trois graphes on peut remarquer que les trois courbes ne sont pas isomorphes, et pourtant elles semblent graviter en dessous ou au dessus de la valeur 1 au cours du temps (c&amp;rsquo;est-&amp;agrave;-dire au cours du d&amp;eacute;veloppement), ce qui expliquerait une tendance implicite du langage humain &amp;agrave; atteindre l&amp;rsquo;&amp;eacute;quilbre d&amp;eacute;crit par la formule de Zipf au cours du dernier si&amp;egrave;cle (Zipf, 1949).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom: 11px; text-align: center;&quot;&gt;&lt;img height=&quot;270&quot; src=&quot;https://www.numerev.com/img/ck_2808_28_image-20221206185958-3.png&quot; width=&quot;460&quot; /&gt;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Figure 2. Evolution de l&amp;rsquo;exposant &lt;i&gt;alpha&lt;/i&gt; pour Adrien&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&lt;img height=&quot;267&quot; src=&quot;https://www.numerev.com/img/ck_2808_28_image-20221206190058-4.png&quot; width=&quot;453&quot; /&gt;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Figure 3. Evolution de l&amp;rsquo;exposant &lt;i&gt;alpha&lt;/i&gt; pour Madeleine&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p align=&quot;center&quot; style=&quot;text-align:center; margin-bottom:11px&quot;&gt;&lt;img height=&quot;264&quot; src=&quot;https://www.numerev.com/img/ck_2808_28_image-20221206190140-5.png&quot; width=&quot;451&quot; /&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom: 11px; text-align: center;&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Figure 4. Evolution de l&amp;rsquo;exposant &lt;i&gt;alpha&lt;/i&gt; pour Julie&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom: 11px; text-align: center;&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;b&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:12.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;Le statut de la norme et de la variation phon&amp;eacute;tico/phonologique.&lt;/span&gt;&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;text-indent:36pt; margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La variation est au c&amp;oelig;ur de l&amp;rsquo;acquisition du langage chez l&amp;rsquo;enfant (Hickmann et al., 2018), elle influence toutes les &amp;eacute;tapes de ce processus, tant sur le plan de la perception que sur le plan de la production, ainsi que sur les diff&amp;eacute;rents niveaux d&amp;rsquo;analyse (en allant de la phon&amp;eacute;tique jusqu&amp;rsquo;&amp;agrave; la pragmatique). On pourrait dire que le seul d&amp;eacute;nominateur commun de l&amp;rsquo;acquisition de la langue maternelle est la variation, puisqu&amp;rsquo;elle est pr&amp;eacute;sente tant au niveau &lt;i&gt;inter&lt;/i&gt;-individuel qu&amp;rsquo;au nivea &lt;i&gt;intra&lt;/i&gt;-individuel. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Comme il est soulign&amp;eacute; par Bates&amp;nbsp;: &amp;laquo;&amp;nbsp;il est toutefois n&amp;eacute;cessaire de relativiser cette apparente uniformit&amp;eacute; en soulignant la tr&amp;egrave;s grande variabilit&amp;eacute; intra et inter- individuelle qui caract&amp;eacute;rise cette acquisition (Bates et al., 1995). &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;L&amp;rsquo;importance de la variation chez les enfants du corpus CoLaJE est bien repr&amp;eacute;sent&amp;eacute;e par les graphes qui montrent l&amp;rsquo;&amp;eacute;volution de plusieurs indices linguistiques propos&amp;eacute;s par les chercheurs qui ont r&amp;eacute;alis&amp;eacute; le corpus CoLaJE (Morgenstern &amp;amp; Parisse, 2012).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Avant d&amp;rsquo;atteindre la ma&amp;icirc;trise de leur langue maternelle et de pouvoir parler comme un adulte, c&amp;rsquo;est-&amp;agrave;-dire avant d&amp;rsquo;&amp;ecirc;tre capable au niveau perceptif et articulatoire de prononcer la forme cible (&lt;i&gt;i.e&lt;/i&gt; la norme sociale) d&amp;rsquo;un mot, les enfants passent &amp;agrave; travers plusieurs &amp;eacute;tapes. La premi&amp;egrave;re est la reconnaissance du niveau suprasegmentale, qui joue &amp;laquo;&amp;nbsp;un r&amp;ocirc;le important dans la mise en place des premi&amp;egrave;res constructions grammaticales, notamment au moment de l&amp;rsquo;apparition des premiers mots et des premi&amp;egrave;res combinaisons de mots, dans la p&amp;eacute;riode qui suit la p&amp;eacute;riode du mot isol&amp;eacute; (stade holophrastique)&amp;nbsp;&amp;raquo; (Martel &amp;amp; Dodane, 2012, p13). La prosodie n&amp;rsquo;a pas &amp;eacute;t&amp;eacute; consid&amp;eacute;r&amp;eacute;e dans cette &amp;eacute;tude pour des raisons de faisabilit&amp;eacute;, le focus &amp;eacute;tant sur lexique d&amp;rsquo;une part et la phon&amp;eacute;tique d&amp;rsquo;autre part. Cependant, les enfants basent leur acquisition sur la prosodie afin de d&amp;eacute;tecter les pauses, les intonations et les accentuations qui les aident &amp;agrave; visualiser la fronti&amp;egrave;re entre mots ainsi que les r&amp;eacute;lations de d&amp;eacute;pendance syntaxique. En fait &amp;laquo;&amp;nbsp;il semble bien que les caract&amp;eacute;ristiques prosodiques soient utilis&amp;eacute;es par l&amp;rsquo;enfant pour poser les fondements des futures constructions grammaticales, mais que celles-ci se manifestent diff&amp;eacute;remment au moment des premiers mots (gabarit temporel des proto-mots et des premiers mots) et des premi&amp;egrave;res combinaisons de mots (contours unitaires qui permettent d&amp;rsquo;assurer la coh&amp;eacute;sion des diff&amp;eacute;rentes unit&amp;eacute;s au sein d&amp;rsquo;une unit&amp;eacute; plus grande)&amp;nbsp;&amp;raquo; (Martel &amp;amp; Dodane, 2012, p32-33).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Le but de l&amp;rsquo;exemple ici propos&amp;eacute; est celui de mod&amp;eacute;liser la structuration des variations phon&amp;eacute;tico/phonologiques au cours du temps ainsi que d&amp;rsquo;estimer le d&amp;eacute;gr&amp;eacute; de variabilite &lt;i&gt;intra&lt;/i&gt;-enfant et &lt;i&gt;inter&lt;/i&gt;-enfants. Des &amp;eacute;tudes pr&amp;eacute;c&amp;eacute;dentes (Dos Santos, 2007 ; Yamaguchi, 2012 ; Morgenstern &amp;amp; Parisse, 2012) ont montr&amp;eacute; qu&amp;rsquo;il n&amp;rsquo;y a pas un parcours &amp;laquo;&amp;nbsp;typique&amp;nbsp;&amp;raquo; dans l&amp;rsquo;acquisition, mais plut&amp;ocirc;t des contraintes d&amp;rsquo;ordre phon&amp;eacute;tique et phonologique qui d&amp;eacute;finissent les contours possibles du cheminement vers la norme adulte. Chaque variation semblerait &amp;ecirc;tre influenc&amp;eacute;e par la variation pr&amp;eacute;c&amp;eacute;dente et, a son tour, exercer une influence sur la variation suivante (Sauvage, 2015). Dans d&amp;rsquo;autres termes, les variations ne seraient pas dues au hasard, mais elles seraient contraintes par plusieurs facteurs comme le lieu d&amp;rsquo;articulation, le mode d&amp;rsquo;articulation, ainsi que la fr&amp;eacute;quence d&amp;rsquo;occurrence d&amp;rsquo;une cible dans l&amp;rsquo;input parentale (Ambridge et al., 2015).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Il y a essentiellement deux th&amp;eacute;ories qui pourraient &amp;ecirc;tre adopt&amp;eacute;es afin de rendre compte des parcours d&amp;rsquo;acquisition&amp;nbsp;: la th&amp;eacute;orie de l&amp;rsquo;optimalit&amp;eacute; (connue sous les termes anglaises de &lt;i&gt;optimality theory&lt;/i&gt;, Prince &amp;amp; Smolensky, 2004) et la th&amp;eacute;orie des traits phonologique (Clements, 1985). Ces th&amp;eacute;ories font &amp;ndash; respectivement &amp;ndash; partie du courant inn&amp;eacute;iste et constructiviste. Dans le cadre de cette &amp;eacute;tude, la th&amp;eacute;orie de Clements a &amp;eacute;t&amp;eacute; adopt&amp;eacute;e&amp;nbsp;pour diff&amp;eacute;rentes raisons&amp;nbsp;: l&amp;rsquo;auteur de l&amp;rsquo;article est convaincu que cette th&amp;eacute;orie a un pouvoir explicatif plus profond et exhaustif de la th&amp;eacute;orie concurrente ; en plus, la majorit&amp;eacute; des r&amp;eacute;f&amp;eacute;rences bibliographiques cit&amp;eacute;es dans cet article adoptent le constructivisme (ou &lt;i&gt;usage-based theory&lt;/i&gt; en anglais) comme point de d&amp;eacute;part des analyses. Cependant, le focus n&amp;rsquo;est pas sur la capacit&amp;eacute; de cette th&amp;eacute;orie de rendre compte de toutes les variations possibles dans les parcours d&amp;rsquo;acquisition des consonnes et des voyelles du fran&amp;ccedil;ais. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La contribution vise &amp;agrave; comprendre comment un algorithme de reconnaissance de motifs s&amp;eacute;quentiels (connu sous l&amp;rsquo;expression anglaise de &lt;i&gt;pattern mining&lt;/i&gt;) puisse nous aider &amp;agrave; fouiller une grande base de suivis longitudinaux qui serait autrement impossible de traiter manuellement. La rapidit&amp;eacute; et la modulabilit&amp;eacute; de cet algorithme pourrait fournir les bases pour comprendre quels sont les facteurs les plus importants dans l&amp;rsquo;acquisition des phon&amp;egrave;mes parmi le lieu d&amp;rsquo;articulation, le mode d&amp;rsquo;articulation et la fr&amp;eacute;quence d&amp;rsquo;occurrence d&amp;rsquo;une cible dans l&amp;rsquo;input parental. Il est en fait difficile de pouvoir quantifier pr&amp;eacute;cisement quelle est la proportion entre ces facteurs.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Le corpus CoLaJE &amp;ndash; via la plateforme num&amp;eacute;rique Ortolang &amp;ndash; offre d&amp;eacute;j&amp;agrave; un outil de requ&amp;ecirc;te pr&amp;eacute;cieux qui aide &amp;agrave; cibler des mots pr&amp;eacute;cis, ainsi que donner la possibilit&amp;eacute; de saisir des expressions r&amp;eacute;guli&amp;egrave;res&lt;a href=&quot;#_ftn1&quot; name=&quot;_ftnref1&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:11.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;. Les r&amp;eacute;sultats propos&amp;eacute;s par cette &lt;i&gt;query &lt;/i&gt;ont &amp;eacute;t&amp;eacute; le point de d&amp;eacute;part, ensuite une analyse plus d&amp;eacute;taill&amp;eacute;e a &amp;eacute;t&amp;eacute; m&amp;eacute;n&amp;eacute;e en utilisant la librairie &amp;laquo;&amp;nbsp;pylangacq&amp;nbsp;&amp;raquo; en langage Python&lt;a href=&quot;#_ftn2&quot; name=&quot;_ftnref2&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:11.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[2]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; (Lee et al., 2016) ainsi que l&amp;rsquo;ensemble d&amp;rsquo;algorithmes pr&amp;eacute;sents dans une autre librairie Python, appel&amp;eacute;e &amp;laquo;&amp;nbsp;pymining&amp;nbsp;&amp;raquo;&lt;a href=&quot;#_ftn3&quot; name=&quot;_ftnref3&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:11.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[3]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Les exemples ci-dessous ont &amp;eacute;t&amp;eacute; choisis pour leur repr&amp;eacute;sentativit&amp;eacute; en fonction de plusieurs crit&amp;egrave;res&amp;nbsp;: la pr&amp;eacute;sence de plusieurs suites consonantiques, le fait d&amp;rsquo;avoir au moins deux syllabes, la pr&amp;eacute;sence de consonnes qui sont acquises relativement tard (le /ʁ/ par exemple), leur fr&amp;eacute;quence &amp;eacute;lev&amp;eacute;e dans le corpus en question (c&amp;rsquo;est-&amp;agrave;-dire, plusieurs occurrences diff&amp;eacute;rentes &amp;agrave; des &amp;acirc;ges diff&amp;eacute;rentes pour plusieurs enfants diff&amp;eacute;rents, ce qui permettrait de poser les bases pour une &amp;eacute;ventuelle g&amp;eacute;n&amp;eacute;ralisation d&amp;rsquo;un parcours typique).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Voici deux exemples d&amp;rsquo;application&amp;nbsp;:&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;i&gt;Premier exemple&lt;/i&gt; : pour le mot cible &amp;lsquo;tracteur&amp;rsquo;, /tʁaktoeʁ/, qui a une structure syllabique du type CCVCCVC, on liste toutes les variation phonetico/phonologiques observ&amp;eacute;es dans les transcriptions des enfants du projet CoLaJE:&amp;nbsp; (le nombre expriment ann&amp;eacute;e/mois/jour)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁikt&amp;oelig;ʁ/ Antoine 2_02_27&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁat&amp;oelig;ʁ/ Antoine 2_02_27 &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁat&amp;oelig;ʁ/ Antoine 2_03_05&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁak&amp;oelig;ʁ/ Antoine 2_04_03&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tatoʁ/ Th&amp;eacute;ophile 2_10_28&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/taktɔ/ Adrien 3_09_09&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/toktɔʁ/ Adrien 4_00_15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/taktɔʁ/ Adrien 4_00_15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/takt&amp;oelig;ʁ/ Adrien 4_02_15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/takt&amp;oelig;ʁ/ Adrien 4_02_15&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/sakt&amp;oelig;ʁ/ Julie 1_06_04 (BRO)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/ʁakt&amp;oelig;ʁ/ Julie 1_07_26&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tat&amp;oslash;/ Julie 1_07_26&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tʁakt&amp;oelig;ʁ/ Julie 2_09_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tʁakt&amp;oelig;ʁ/ Julie 2_09_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;On observe que les variations autour de la norme (ou cible phon&amp;eacute;tico/phonologique) /tʁakt&amp;oelig;ʁ/ varient en fonction de l&amp;rsquo;&amp;acirc;ge et de l&amp;rsquo;enfant.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Deuxi&amp;egrave;me exemple : mot cible crayon / kʁɛjɔ̃/ , structure syllabique CCVCV&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Antoine 2_06_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁejɔ̃/ Antoine 2_06_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁejɔ̃/ Antoine 2_06_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tʁɛjɔ̃/ Antoine 2_06_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kɛʁejɔ̃/ Th&amp;eacute;ophile 3_02_00&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/crejɔ̃ː/ Th&amp;eacute;ophile 3_04_10&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Th&amp;eacute;ophile 3_05_11&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Th&amp;eacute;ophile 3_07_08&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Th&amp;eacute;ophile 4_03_29&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Th&amp;eacute;ophile 4_09_07&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kijo / Adrien 4_01_12&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Julie 2_03_08&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Julie 2_11_01&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Julie 3_04_21&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛʒjɔ̃/ Julie 3_04_21&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁɛjɔ̃/ Julie 3_04_21&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kejɔ̃/ Anae 2_00_26&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tejɔ̃/ Anae 2_00_26&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tijɔ̃/ Anae 2_00_26&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tijɔ̃/ Anae 2_00_26&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁejɔ̃/ Anae 2_06_27&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/jʁajɔ̃/ Anae 2_08_24&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁejɔ̃/ Anae 5_10_30&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Le d&amp;eacute;nominateur commun entre Ana&amp;euml; et Julie est qu&amp;rsquo;elles semblent &amp;ndash; autour de l&amp;rsquo;age 2 ans et demi/trois ans &amp;ndash; avoir appris une fois pour toutes la forme correcte du mot cible, puisqu&amp;rsquo;elles arrivent &amp;agrave; bien l&amp;rsquo;articuler &amp;agrave; des intervals de temps successifs. Cependant, elles produisent une variation qu&amp;rsquo;elles n&amp;rsquo;avaient jamais produit auparavant au cours des &amp;acirc;ges plus avanc&amp;eacute;s (pour &amp;ecirc;tre plus pr&amp;eacute;cis, il faut remarquer qu&amp;rsquo;il pourrait s&amp;rsquo;agir &amp;eacute;galement d&amp;rsquo;une variation qui n&amp;rsquo;avait pas &amp;eacute;t&amp;eacute; collect&amp;eacute;e par la densit&amp;eacute; d&amp;rsquo;&amp;eacute;chantillonnage de 1heure par mois pr&amp;eacute;vu par le projet ColaJE, voir (Yamaguchi, 2018)). Ce ph&amp;eacute;nom&amp;egrave;ne, bien qu&amp;rsquo;il soit contre-intuitif &amp;ndash; il est assez commun en acquisition L1 (Sauvage, 2015, p125, en particulier le concept de &amp;lsquo;r&amp;eacute;gression&amp;rsquo;).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La proc&amp;eacute;dure pour rep&amp;eacute;rer et analyser les variations est la suivante&amp;nbsp;: &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;i) chercher le mot d&amp;eacute;sir&amp;eacute; via la &lt;i&gt;query &lt;/i&gt;du projet Ortolang&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;ii) avoir acc&amp;egrave;s aux transcriptions des enfants CoLaJE par le biais de la librairie &amp;lsquo;pylangacq&amp;rsquo;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;iii) mettre en place un algorithme du type &amp;lsquo;if-then&amp;rsquo;, v&amp;eacute;rifier si le mot prononc&amp;eacute; est diff&amp;eacute;rent du mot cible ou pas.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-bottom:11px; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;iv) Si oui, d&amp;eacute;tecter sa structure syllabique via &amp;lsquo;pymining&amp;rsquo;. Si non, la ligne du code se termine ainsi.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La partie la plus difficile consiste en la d&amp;eacute;finition de la variation, c&amp;rsquo;est-&amp;agrave;-dire qu&amp;rsquo;une fois que la variation a &amp;eacute;t&amp;eacute; d&amp;eacute;tect&amp;eacute;e, il faudrait apprendre &amp;agrave; la machine &amp;agrave; la classer dans une des cat&amp;eacute;gories ci-dessous, qui &amp;agrave; leur tour se basent sur plusieurs crit&amp;egrave;res (lieu et mode d&amp;rsquo;articulation, voisement, ordre des syllabes) : &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;1) Omission&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;2) &amp;nbsp;Substition&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;3) Assimilation&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;4) R&amp;eacute;duction&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;5) Duplication&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;6) Epenth&amp;egrave;se&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-18pt; margin-bottom:11px; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;7) M&amp;eacute;tath&amp;egrave;se&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Le point d&amp;rsquo;arr&amp;ecirc;t de cet essai a &amp;eacute;t&amp;eacute; la structure syllabique puisqu&amp;rsquo;il a &amp;eacute;t&amp;eacute; difficile de programmer la partie concernante les 7 variations phonologiques possibles&amp;nbsp;: trop de variables et trop d&amp;rsquo;&amp;eacute;tapes cons&amp;eacute;quentielles &amp;eacute;taient pr&amp;eacute;sentes. Par exemple, une fois avoir d&amp;eacute;tect&amp;eacute; une substitution, il aurait fallu aussi trouver un moyen de classer cette substituion en fonction du phon&amp;egrave;me remplac&amp;eacute;&amp;nbsp;: une substitution de fricatives par des occlusives n&amp;rsquo;est pas &amp;eacute;quivalente &amp;agrave; une substitution de liquides par des semi-voyelles. Un autre exemple encore plus complexe&amp;nbsp;: dans l&amp;rsquo;assimilation deux sons deviennent semblables au niveau du lieu d&amp;rsquo;articulation, du mode d&amp;rsquo;articulation ou du voisement, mais l&amp;rsquo;on voit bien qu&amp;rsquo;il ne serait pas rigoureux de mettre sur le m&amp;ecirc;me plan ces trois crit&amp;egrave;res, il aurait peut-&amp;ecirc;tre fallu concevoir une hi&amp;eacute;rarchie, mais laquelle&amp;nbsp;?&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Un dernier exemple&amp;nbsp;: pour le cas des m&amp;eacute;tath&amp;egrave;ses, l&amp;rsquo;&amp;eacute;cueil principal a &amp;eacute;t&amp;eacute; le nombre et la vari&amp;eacute;t&amp;eacute; de ces derni&amp;egrave;res&amp;nbsp;: a&amp;eacute;roport &amp;rarr;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; [aʁeopɔʁ]&amp;nbsp; n&amp;rsquo;est pas identique au cas suivant toboggan &amp;rarr; [togobɑ̃]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dans le premier cas il s&amp;rsquo;agit d&amp;rsquo;une m&amp;eacute;tath&amp;egrave;se entre une consonne et une voyelle, dans le deuxi&amp;egrave;me cas d&amp;rsquo;une m&amp;eacute;tath&amp;egrave;se entre deux consonnes.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;On pourrait ajouter &amp;eacute;galement une autre difficult&amp;eacute;&amp;nbsp;: les variations li&amp;eacute;es au processus phonologiques &amp;eacute;numer&amp;eacute;es ci-dessus peuvent avoir lieu en d&amp;eacute;but d&amp;rsquo;un mot, au milieu ou &amp;agrave; la fin, et elles peuvent concerner une seule consonne ou voyelle ou bien une syllabe.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Pendant la r&amp;eacute;flexion autour de la multiplicit&amp;eacute; de ces variations, des questions ont &amp;eacute;t&amp;eacute; r&amp;eacute;currentes&amp;nbsp;: puisqu&amp;rsquo;il y a des variations de nature diff&amp;eacute;rente, est-ce qu&amp;rsquo;il faut attribuer un poids diff&amp;eacute;rent selon la nature de la variation&amp;nbsp;? Quels crit&amp;egrave;res pourrait-t-on adopter afin d&amp;rsquo; attribuer ce poids ?&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Malheuresement, il n&amp;rsquo;a pas &amp;eacute;t&amp;eacute; possible de prendre en compte toutes ces possibles variations, trop de facteurs concurrents sont en jeu et les comp&amp;eacute;tences de l&amp;rsquo;auteur ne sont pas &amp;agrave; l&amp;rsquo;hauteur d&amp;rsquo;une t&amp;acirc;che si complexe. N&amp;eacute;anmoins, certains travaux ont conduit &amp;agrave; un travail analogue, par exemple le r&amp;eacute;seau neurones qui prend en compte &amp;agrave; la fois l&amp;rsquo;aspect phon&amp;eacute;tique et phonologique propos&amp;eacute; par l&amp;rsquo;inventeur du logiciel PRAAT, Paul Broesma (Boersma et al., 2020) propose des pistes qui pourraient r&amp;eacute;pondre aux questionnements ci-dessus.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Il est clair qu&amp;rsquo;il est difficile de d&amp;eacute;gager un parcours typique &amp;agrave; partir de ces variations&amp;nbsp;: le nombre et la nature des variations est relativement trop grand. Le premier obstacle est d&amp;rsquo;ordre purement statistique, il s&amp;rsquo;agit de la relation entre &amp;eacute;chantillon et population&amp;nbsp;: malheuresement, il n&amp;rsquo;y avait pas moyen d&amp;rsquo;avoir une occurrence de chaque mot pour chaque enregistrement mensuel et pour chaque enfant du corpus CoLaJE&amp;nbsp;: m&amp;ecirc;me les mots les plus fr&amp;eacute;quents peuvent parfois manquer, notamment aux plus jeunes &amp;acirc;ges lorsque les enfants parlent relativement peu. Le deuxi&amp;egrave;me obstacle est de comprendre pourquoi une variation s&amp;rsquo;est elle produite &amp;agrave; la place d&amp;rsquo;une autre. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Par exemple, pourquoi /toktɔʁ/ Adrien 4_00_15 et /taktɔʁ/ Adrien 4_00_15 ? Il serait difficile de croire que l&amp;rsquo;enfant &amp;agrave; 4 ans ne soit pas capable de percevoir et articuler la diff&amp;eacute;rence entre les voyelles /o/ et /a/.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Ensuite, le troisi&amp;egrave;me obstacle est l&amp;rsquo;int&amp;eacute;rpretation de la cause de la variation, c&amp;rsquo;est -&amp;agrave;-dire les motivations qui ont port&amp;eacute; un enfant &amp;agrave; prononcer telle variation ou une autre, par exemple une strat&amp;eacute;gie d&amp;rsquo;&amp;eacute;vitement qui porte les enfants &amp;agrave; omettre ou &amp;agrave; r&amp;eacute;duire une consonne cible qui demande trop d&amp;rsquo;effort, comme dans le cas suivant&amp;nbsp;:&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/tatoʁ/ Th&amp;eacute;ophile 2_10_28&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;ou bien une assimilation, qui porte un enfant &amp;agrave; pr&amp;eacute;f&amp;eacute;rer les suites syllabiques qui ont un point d&amp;rsquo;articulation en commun, comme dans le cas ci-dessous&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;/kʁak&amp;oelig;ʁ/ Antoine 2_04_03.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Apr&amp;egrave;s avoir essay&amp;eacute; plusieurs combinaisons d&amp;rsquo;algorithmes pour plusieurs mots diff&amp;eacute;rents, les limites de l&amp;rsquo;approche informatis&amp;eacute;e ont pris forme. Il n&amp;rsquo;est possible que de confirmer les tendances d&amp;rsquo;acquisition qui ont d&amp;eacute;j&amp;agrave; &amp;eacute;t&amp;eacute; confirm&amp;eacute;es par la litt&amp;eacute;rature existante (Dos Santos, 2007&amp;nbsp;; Yamaguchi, 2012), par exemple l&amp;rsquo;ordre d&amp;rsquo;acquisition de voyelle ou de consonnes, ainsi que les variations les plus fr&amp;eacute;quentes et celles moins fr&amp;eacute;quentes. Mais pour ce qui concerne la pr&amp;eacute;diction avec un d&amp;eacute;gr&amp;eacute; de pr&amp;eacute;cision acceptable, il a &amp;eacute;t&amp;eacute; difficile d&amp;rsquo;envisager la compr&amp;eacute;hension des suites des variations au cours du temps&amp;nbsp;: quelle variation suivra en fonction des deux variations pr&amp;eacute;c&amp;eacute;dentes&amp;nbsp;? Cette question reste en qu&amp;ecirc;te d&amp;rsquo;une r&amp;eacute;ponse.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;La combinaison d&amp;rsquo;algorithmes s&amp;rsquo;est r&amp;eacute;v&amp;eacute;l&amp;eacute;e une m&amp;eacute;thode infructueuse, la variabilit&amp;eacute; &lt;i&gt;intra&lt;/i&gt;-enfant et &lt;i&gt;inter&lt;/i&gt;-enfants &amp;eacute;tant trop grande. Une autre piste possible pourrait &amp;ecirc;tre celle de se focaliser sur un sujet plus restreint, par exemple explorer les variations syllabiques analogues comme les occlusives-liquides. On pourrait commencer en dressant une liste suffisamment repr&amp;eacute;sentative de mots qui contiennent ce type de syllabe et proc&amp;eacute;der &amp;eacute;tape par &amp;eacute;tape (Cfr aux 4 &amp;eacute;tapes list&amp;eacute;es ci-dessus). Ce focus devrait permettre de r&amp;eacute;duire consid&amp;eacute;rablement le nombre de variations possible et rendre par la suite la t&amp;acirc;che de programmation plus simple. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Pour conclure, ces r&amp;eacute;sultats montrent comment il est apriori b&amp;eacute;n&amp;eacute;fique de mod&amp;eacute;liser les multiples variations phon&amp;eacute;tico/phonologiques &amp;agrave; l&amp;rsquo;aide d&amp;rsquo;outils TAL&amp;nbsp;: on s&amp;rsquo;aper&amp;ccedil;oit que &amp;ndash; malgr&amp;eacute; la nature des variations soit multiforme et leur nombre &amp;eacute;lev&amp;eacute; &amp;ndash; elles peuvent &amp;ecirc;tre inclues dans un seul mod&amp;egrave;le qui pourrait rendre compte des r&amp;egrave;gles qui r&amp;eacute;gissent les parcours possibles de leur &amp;eacute;volution. Comme il a d&amp;eacute;j&amp;agrave; &amp;eacute;t&amp;eacute; dit, les r&amp;eacute;sultats pr&amp;eacute;sent&amp;eacute;s dans cette &amp;eacute;tude n&amp;rsquo;ont qu&amp;rsquo;une valeur anedoctiques&amp;nbsp;: ils s&amp;rsquo;accordent de manipre globale &amp;agrave; des &amp;eacute;tudes de cas qui ont &amp;eacute;t&amp;eacute; m&amp;eacute;n&amp;eacute;es sur le m&amp;ecirc;me corpus (Yamaguchi, 2012) ou sur d&amp;rsquo;autres enfants francophones collect&amp;eacute;s avec des m&amp;eacute;thodes comparables (Dos Santos, 2007).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Ce paragraphe contient une partie de nombreux travaux de fouille et mod&amp;eacute;lisation du corpus CoLaJE qui ont &amp;eacute;t&amp;eacute; produits lors d&amp;rsquo;une collaboration entre linguistes et informaticiens de l&amp;rsquo;Universit&amp;eacute; &amp;laquo;&amp;nbsp;Paul Val&amp;eacute;ry&amp;nbsp;&amp;raquo; Montpellier (pour la pr&amp;eacute;cision les &lt;i&gt;data scientists&lt;/i&gt; du master MIASHS guid&amp;eacute;s par S. Bringay) pendant l&amp;rsquo;ann&amp;eacute;e acad&amp;eacute;mique 2019-2020. Pour un aper&amp;ccedil;u des ces travaux, veuillez suivre le lien en bas de page&lt;a href=&quot;#_ftn4&quot; name=&quot;_ftnref4&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span lang=&quot;FR&quot; style=&quot;font-size:11.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[4]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;b&gt;Conclusions&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Le but de cet article &amp;eacute;tait celui de mener une r&amp;eacute;fl&amp;eacute;xion autour de l&amp;rsquo;utilisation de mod&amp;egrave;les et techniques TAL pour mettre en relief la relation entre norme et variation dans le cadre de l&amp;rsquo;acquisition du fran&amp;ccedil;ais langue premi&amp;egrave;re. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Deux cas d&amp;rsquo;&amp;eacute;tude ont &amp;eacute;t&amp;eacute; propos&amp;eacute;s&amp;nbsp;: dans le premier la variation avait une double articulation au niveau lexical et au niveau du vocabulaire de l&amp;rsquo;enfant. Les r&amp;eacute;sultats d&amp;rsquo;une &amp;eacute;tude pr&amp;eacute;c&amp;eacute;dente (Briglia et al., 2022) ont montr&amp;eacute; comment la cr&amp;eacute;ation d&amp;rsquo;un mod&amp;egrave;le unifi&amp;eacute; de la cat&amp;eacute;gorie de mot (con&amp;ccedil;u comme une unit&amp;eacute; compos&amp;eacute;e par trois constituants&amp;nbsp;: signifiant- signifi&amp;eacute; - r&amp;eacute;f&amp;eacute;rent) permet de rassembler plusieurs variations phon&amp;eacute;tico/phonologiques sous une m&amp;ecirc;me cat&amp;eacute;gorie afin de faciliter l&amp;rsquo;analyse d&amp;rsquo;un autre type de variation, celle de l&amp;rsquo;exposant &lt;i&gt;alpha, &lt;/i&gt;un index qui repr&amp;eacute;sente comment la distribution de fr&amp;eacute;quence des mots dans le vocabulaire de l&amp;rsquo;enfant varie &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;i) au cours du temps (intra-enfant) &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;ii) entre les enfants (inter-enfants)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;text-indent:-36pt; margin-bottom:11px; margin-left:48px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;iii) entre les enfants et leur respectifs parents respectifs (corr&amp;eacute;lation de Spearman). &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Pour cette derni&amp;egrave;re analyse, le codage des transcriptions CHI- FAT-MOT, la mise au point de crit&amp;egrave;res pour unifier les variations sous un seul ensemble ainsi que le calcul des fr&amp;eacute;quences d&amp;rsquo;occurrence et des corr&amp;eacute;lations a &amp;eacute;t&amp;eacute; fait automatiquement en langage Python.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dans le deuxi&amp;egrave;me exemple on a pu appr&amp;eacute;cier la rapidit&amp;eacute; des algorithmes de reconnaissance de motifs s&amp;eacute;quentiels et comprendre comment la prise en compte de toutes les variations phon&amp;eacute;tico-phonologiques autour de la norme adulte est th&amp;eacute;oriquement faisable, m&amp;ecirc;me si dans la pratique il est difficile d&amp;rsquo;attribuer la bonne place et le juste poids aux diff&amp;eacute;rents crit&amp;egrave;res qui d&amp;eacute;finissent les variations. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;L&amp;rsquo;application de mod&amp;egrave;les, de techniques et de r&amp;eacute;f&amp;eacute;rentiels issus de l&amp;rsquo;informatique dans le domaine de la linguistique est croissant et permet la v&amp;eacute;rification d&amp;rsquo;hypotheses de mani&amp;egrave;re fiable, r&amp;eacute;productible et rapide. En plus, la plupart des logiciels pour l&amp;rsquo;analyse des corpus (Antconc, TXM, Iramuteq), de la parole (PRAAT, PHON) ou de la gestualit&amp;eacute; (ELAN) sont en libre acc&amp;egrave;s et &lt;i&gt;open source, &lt;/i&gt;ce qui repr&amp;eacute;sente un v&amp;eacute;ritable avantage. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Malgr&amp;eacute; ces avantages, l&amp;rsquo;adoption des techniques TAL ne doit pas &amp;ecirc;tre interpr&amp;eacute;t&amp;eacute;e comme un passepartout qui se fait apriori d&amp;rsquo;une connaissance approfondie de la langue elle-m&amp;ecirc;me ou du ph&amp;eacute;nom&amp;egrave;ne linguistique (l&amp;rsquo;acquisition L1 par exemple). La rapidit&amp;eacute; et la puissance de calcul doivent &amp;ecirc;tre dirig&amp;eacute;es par des assomptions, des hypoth&amp;egrave;ses, des cadres th&amp;eacute;oriques que &amp;ndash; &amp;agrave; l&amp;rsquo;heure d&amp;rsquo;aujourd&amp;rsquo;hui &amp;ndash; seuls les intelligences humaines peuvent ma&amp;icirc;triser.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;D&amp;rsquo;autres outils TAL d&amp;eacute;velopp&amp;eacute;s au sein de la communaut&amp;eacute; francophone qui pourraient &amp;ecirc;tre utilis&amp;eacute;s afin d&amp;rsquo;&amp;eacute;valuer l&amp;rsquo;acquisition du fran&amp;ccedil;ais langue premi&amp;egrave;re chez l&amp;rsquo;enfant sont l&amp;rsquo;iPhocomp (Lee et al., 2014) et l&amp;rsquo;ISC (Index de complexit&amp;eacute; syntaxique, Szmrecsanyi, 2004). En effet, lorqu&amp;rsquo;on poss&amp;egrave;de de suivis longitudinaux disponibles sous plusieurs formats diff&amp;eacute;rents comme pour le corpus CoLaJE, on a par cons&amp;eacute;quent l&amp;rsquo;opportunit&amp;eacute; d&amp;rsquo;obtenir un score pour chaque mot et/ou &amp;eacute;nonc&amp;eacute; prononc&amp;eacute; par l&amp;rsquo;enfant en automatisant &amp;ndash; par le biais d&amp;rsquo;un langage de programmation comme Python - la t&amp;acirc;che de calcul de ces scores pour chaque ligne de code - qu&amp;rsquo;elle soit la ligne CHI, pho ou mod et quelque soit son format (csv, CHAT ou TEI pour ne citer que les formats pr&amp;eacute;sents sur CoLaJE-Ortolang). Une &amp;eacute;tude r&amp;eacute;cente a montr&amp;eacute; la validit&amp;eacute; de l&amp;rsquo;emploi de ces deux scores pour la pr&amp;eacute;diction de l&amp;rsquo;acquisition de certaines cat&amp;eacute;gories grammaticales sur une &amp;eacute;tude de cas tir&amp;eacute; du corpus CoLaJE (Briglia et al., 2022).&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Au cours de ces derni&amp;egrave;res ann&amp;eacute;es, la technologie TAL qui semblerait &amp;ecirc;tre la plus compl&amp;egrave;te et exhaustive, le BERT (acronyme pour Bidirectional Encoder Representations from Transformes) a &amp;eacute;t&amp;eacute; am&amp;eacute;lior&amp;eacute;e (en termes de performance pour la langue fran&amp;ccedil;aise) gr&amp;acirc;ce &amp;agrave; la prise en compte des particularit&amp;eacute;s de la langue vis&amp;eacute;e. C&amp;rsquo;est ainsi que CamemBERT (Martin et al., 2020) a pu voir le jour.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;On pourrait craindre que cette augmentation constante de la pr&amp;eacute;sence de l&amp;rsquo;informatique dans le champ d&amp;rsquo;investigation qui a traditionellement fait partie de la linguistique causera &amp;ndash; dans un avenir proche ou lointain &amp;ndash; un d&amp;eacute;classement de cette derni&amp;egrave;re. Ces craintes sont vraisemblables, pourtant il est &amp;agrave; noter que tout syst&amp;egrave;me d&amp;rsquo;annotation automatique en parties du discours (POS tagging en anglais), classification de texte, plongement de mots en allant jusqu&amp;rsquo;aux derni&amp;egrave;res technologies d&amp;rsquo;apprentissage machine (BERT ou, plus g&amp;eacute;n&amp;eacute;ralement, les r&amp;eacute;seau neurones, qu&amp;rsquo;ils soient supervis&amp;eacute;s ou pas), ne peut pas &amp;ecirc;tre con&amp;ccedil;u sans une connaissance linguistique pr&amp;eacute;alable. &amp;nbsp;En plus, bien que l&amp;rsquo;intelligence artificielle soit toujours plus raffin&amp;eacute;e dans ses pr&amp;eacute;dictions et ses inf&amp;eacute;rences sur le langage, elle pr&amp;eacute;sente des probl&amp;egrave;mes r&amp;eacute;currents au niveau de la coarticulation (les technologies &lt;i&gt;speech-to-text&lt;/i&gt; et &lt;i&gt;text-to-speech&lt;/i&gt;), la synonimie et la polys&amp;eacute;mie, ainsi que pour ce qui concerne la signification en contexte (&lt;i&gt;i.e&lt;/i&gt; le niveau pragmatique). En d&amp;rsquo;autres mots, tout ce qui rel&amp;egrave;ve de la compr&amp;eacute;hension des diff&amp;eacute;rents accents ou des diff&amp;eacute;rentes acceptions, du style, de la nuance, de variation en fonction du contexte, d&amp;rsquo;ambigu&amp;iuml;t&amp;eacute;s ou bien de sous-entendus reste particuli&amp;egrave;rement accident&amp;eacute; pour les machines. La souplesse, ainsi que la cr&amp;eacute;ativit&amp;eacute;, sembleraient devoir rester des comp&amp;eacute;tences mieux ma&amp;icirc;tris&amp;eacute;es par les intelligences humaines.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Ces diff&amp;eacute;rences nous montrent comme une synergie entre linguistes et informaticiens pourrait constituer le noyau d&amp;rsquo;une bonne partie des futures recherches dans le domaine du langage.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;b&gt;Bibliographie&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Ambridge, B., Kidd, E., Rowland, C. F., &amp;amp; Theakston, A. L. (2015). The ubiquity of frequency effects in first language acquisition. &lt;i&gt;Journal of child language&lt;/i&gt;, &lt;i&gt;42&lt;/i&gt;(2), 239-273&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Baixeries J., Elvevag B. and Ferrer-i-Cancho R. (2013). The Evolution of the Exponent of Zipf&amp;rsquo;s Law in&lt;br /&gt;
Language Ontogeny. PLoS ONE 8(3): e53227&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Bates, E., Dale, P. S., &amp;amp; Thal, D. (1995). &lt;i&gt;Individual differences and their implications for theories of language development&lt;/i&gt;. The handbook of child language, 30, 96-151&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Boersma, P., Benders, T., &amp;amp; Seinhorst, K. (2020). Neural network models for phonology and phonetics. &lt;i&gt;Journal of Language Modelling Vol&lt;/i&gt;, &lt;i&gt;8&lt;/i&gt;(1), 103-177&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Briglia A. &amp;ldquo;Statistical and computational approaches to first language acquisition. Mining a set of French longitudinal corpora (CoLaJE&amp;rdquo;). Th&amp;egrave;se Universit&amp;eacute; Paul Val&amp;eacute;ry Montpellier 3; Universit&amp;agrave; di Messina. 2021. &lt;a href=&quot;https://hal.archives-ouvertes.fr/tel-03319126&quot; style=&quot;color:blue; text-decoration:underline&quot; target=&quot;_blank&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Cambria Math&amp;quot;,serif&quot;&gt;&amp;lang;&lt;/span&gt;tel-03319126&lt;span style=&quot;font-family:&amp;quot;Cambria Math&amp;quot;,serif&quot;&gt;&amp;rang;&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Briglia A., Mucciardi M., Pirrotta G. (2022). &amp;ldquo;A statistical model for predicting child language acquisition: unfolding qualitative grammatical development by using logistic regression model&amp;rdquo;. In Salvati N., Perna C., Marchetti S., Chambers R. &amp;ldquo;Studies in Theoretical and Applied Statistics&amp;rdquo;. Springer Proceedings in Mathematics &amp;amp; Statistics. PROMS, volume 406. SIS 2021, Pisa.&amp;nbsp; &lt;i&gt;in press&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Briglia A., Mucciardi M., Pirrotta G. &amp;ldquo;The development of word frequency distribution in first language acquisition. An analysis on a spoken language corpus of French children&amp;rdquo;. Vadistat Press. &lt;i&gt;Proceedings of the 16th International Conference on Statistical Analysis of Textual Data (JADT)&lt;/i&gt;, 1 (16)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Clements, G. N. (1985). The geometry of phonological features. &lt;i&gt;Phonology yearbook&lt;/i&gt; 2.225-252&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dos Santos C. (2007).&amp;nbsp; Developpement phonologique en francais langue maternelle : une etude de cas&amp;rdquo;. Phd thesis Universite Lumi&amp;egrave;re Lyon2&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Dubois, J., Marcellesi, J-B., M&amp;eacute;yel, J-P. &amp;amp; Giascamo, M. (1973). &lt;i&gt;Dictionnaire de linguistique&lt;/i&gt;. Paris : Larousse&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Goodman J., Dale P. and Li P. (2008). &lt;i&gt;Does frequency count? Parental input and the acquisition of vocabulary&lt;/i&gt;. Journal of Child Language, 35(03), 515&amp;ndash;531&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Hickmann M.; Veneziano E.; Harriett J. (Eds) (2018). Sources of Variation in First Language Acquisition. Languages, contexts and learners.&amp;nbsp; John Benjamins&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;a name=&quot;_Hlk120873323&quot;&gt;Hoedt, A., &amp;amp; Piron, J. (2016). &lt;/a&gt;&lt;i&gt;La faute de l&amp;rsquo;orthographe. &lt;/i&gt;Paris&lt;i&gt;, Textuel&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Jamila Sebbar Barge. Pour une nouvelle conception de la &amp;quot;norme&amp;quot; linguistique dans l&amp;#39;enseignement des langues. &lt;a href=&quot;https://hal.archives-ouvertes.fr/hal-00385090v2&quot; style=&quot;color:blue; text-decoration:underline&quot; target=&quot;_blank&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Cambria Math&amp;quot;,serif&quot;&gt;&amp;lang;&lt;/span&gt;hal-00385090v2&lt;span style=&quot;font-family:&amp;quot;Cambria Math&amp;quot;,serif&quot;&gt;&amp;rang;&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Jusoh, S. (2018). A study on NLP applications and ambiguity. &lt;i&gt;Journal of Theoretical &amp;amp; Applied Information Technology&lt;/i&gt;, &lt;i&gt;96&lt;/i&gt;(6)&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Kraif O., Ponton C. (2007). Du bruit, du silence et des ambigu&amp;iuml;t&amp;eacute;s : que faire du TAL pour l&amp;rsquo;apprentissage des langues ? In &lt;i&gt;Actes de la 14&amp;egrave;me conf&amp;eacute;rence sur le Traitement Automatique des Langues Naturelles&lt;/i&gt;. Posters, pages 143&amp;ndash;152, Toulouse, France. ATALA&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Lee, H., Gambette, P., Barkat-Defradas, M. (2014). iPhocomp: calcul automatique de l&amp;rsquo;indice de complexit&amp;eacute; phon&amp;eacute;tique de Jakielski. &lt;i&gt;JEP 2014, XXX&amp;egrave; &amp;eacute;dition des Journ&amp;eacute;es d&amp;#39;Etudes sur la Parole&lt;/i&gt;, Le Mans, France. pp.622-630, 2014, Actes de la XXXe &amp;eacute;dition des Journ&amp;eacute;es d&amp;#39;Etudes sur la Parole. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Lee, Jackson L., Ross Burkholder, Gallagher B. Flinn, and Emily R. Coppess. (2016). Working with CHAT transcripts in Python. &lt;i&gt;Technical report TR-2016-02&lt;/i&gt;, Department of Computer Science, University of Chicago.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Lestrade S. (2017). Unzipping Zipf&amp;rsquo;s law. &lt;i&gt;PlosOne&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;MacWhinney, B. (2000). The Childes Project: Tools for Analyzing Talk, Volume II: the Database (3rd ed.). &lt;i&gt;Psychology Press&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Martel, K., &amp;amp; Dodane, C. (2012). Le r&amp;ocirc;le de la prosodie dans les premi&amp;egrave;res constructions grammaticales : &amp;eacute;tude de cas d&amp;#39;un enfant fran&amp;ccedil;ais monolingue. &lt;i&gt;Journal of French Language Studies&lt;/i&gt;, &lt;i&gt;22&lt;/i&gt;(1), 13-35&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Martin, L., Muller, B., Su&amp;aacute;rez, P. J. O., Dupont, Y., Romary, L., de la Clergerie, &amp;Eacute;. V., Sagot, B. (2020). CamemBERT: a Tasty French Language Model. In &lt;i&gt;ACL 2020-58th Annual Meeting of the Association for Computational Linguistics&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Morgenstern A. ; Parisse C. (2012). The Paris Corpus. &lt;i&gt;French language studies&lt;/i&gt; &lt;i&gt;22&lt;/i&gt;. 7-12. Cambridge University Press. Special Issue&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Mucciardi M., Pirrotta G., Briglia A., Sallaberry A. (2021). Visualizing cluster of words: a graphical approach to grammar acquisition. In Giovanni C. Porzio; Carla Rampichini; Chiara Bocci (Eds). &lt;i&gt;CLADAG 2021 BOOK OF SHORT PAPERS. &lt;/i&gt;&lt;i&gt;13th Scientific Meeting of the Classification and Data Analysis Group - &lt;/i&gt;Firenze University Press, pp.392-395&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Piantadosi S. (2014). Zipf&amp;rsquo;s word frequency law in natural language: A critical review and future&lt;br /&gt;
directions. &lt;i&gt;Psychon Bull Rev&lt;/i&gt;.; 21(5): 1112&amp;ndash;1130&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Prince, A., Smolensky P. (2004): Optimality Theory: Constraint Interaction in Generative Grammar. &lt;i&gt;Blackwell Publishers&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Sauvage J. (2015). &amp;nbsp;L&amp;rsquo;acquisition du langage : un systeme complexe. &lt;i&gt;L&amp;rsquo;Harmattan&lt;/i&gt;, Louvain&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Siouffi, G., &amp;amp; Steuckardt, A. (&amp;eacute;ds). (2007). &lt;i&gt;Les linguistes et la norme&lt;/i&gt;. Berne : Peter Lang&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Srikant R., Agrawal R. (1996). Mining Sequential Patterns: Generalizations and Per-formance Improvements. &lt;i&gt;Proceedings of the 5th International Conference on Extending Database Technology (EDBT&amp;rsquo;96)&lt;/i&gt;. Avignon. France. p. 3-1&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Szmrecsanyi, B. (2004). On operationalizing syntactic complexity, in: Purnelle, G&amp;eacute;rard, C&amp;eacute;drick Fairon and Anne Dister (eds.), Le poids des mots. &lt;i&gt;Proceedings of the 7th International Conference on Textual Data Statistical Analysis. &lt;/i&gt;&lt;i&gt;Vol. 2&lt;/i&gt;. Louvain-la-Neuve, Presses Universitaires de Louvain. &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Thomson, J. R., &amp;amp; Chapman, R. S. (1977). Who is daddy revisited: The status of two-year-olds&amp;#39; over-extended words in use and comprehension. &lt;i&gt;Journal of Child Language, 4&lt;/i&gt;(3), 359&amp;ndash;375&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Tomasello, M., &amp;amp; Stahl, D. (2004). Sampling children&amp;#39;s spontaneous speech: How much is enough?. &lt;i&gt;Journal of child language&lt;/i&gt;, &lt;i&gt;31&lt;/i&gt;(1), 101-121&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Vihman, M. M. and McCune L. (1994). When is a word a word? &lt;i&gt;Journal of Child Language&lt;/i&gt;, 21(3),&lt;br /&gt;
517&amp;ndash;542&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Yamaguchi N. (2012).&amp;nbsp; Parcours d&amp;rsquo;acquisition des sons du langage chez deux enfants francophones. Phd thesis, Sorbonne Nouvelle University (Paris 3). &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Yamaguchi N. (2018). What is a representative language sample for word and sound acquisition? &lt;i&gt;Revue canadienne de linguistique&lt;/i&gt;. University of Toronto Press. 63 (04), pp.667-685&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&lt;span style=&quot;font-size:11pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;Zipf G.K. (1949). Human behaviour and the principle of least effort. &lt;i&gt;Addison-Wesley.&lt;/i&gt; Cambridge (MA), USA&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;div&gt;&amp;nbsp;
&lt;hr align=&quot;left&quot; size=&quot;1&quot; width=&quot;33%&quot; /&gt;
&lt;div id=&quot;ftn1&quot;&gt;
&lt;p class=&quot;MsoFootnoteText&quot;&gt;&lt;span style=&quot;font-size:10pt&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;a href=&quot;#_ftnref1&quot; name=&quot;_ftn1&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span style=&quot;font-size:10.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; &lt;a href=&quot;https://ct3xq.ortolang.fr/ct3xq/interro&quot; style=&quot;color:blue; text-decoration:underline&quot;&gt;https://ct3xq.ortolang.fr/ct3xq/interro&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;div id=&quot;ftn2&quot;&gt;
&lt;p class=&quot;MsoFootnoteText&quot;&gt;&lt;span style=&quot;font-size:10pt&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;a href=&quot;#_ftnref2&quot; name=&quot;_ftn2&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span style=&quot;font-size:10.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[2]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; &lt;a href=&quot;https://pylangacq.org/&quot; style=&quot;color:blue; text-decoration:underline&quot;&gt;https://pylangacq.org/&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;div id=&quot;ftn3&quot;&gt;
&lt;p class=&quot;MsoFootnoteText&quot;&gt;&lt;span style=&quot;font-size:10pt&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;a href=&quot;#_ftnref3&quot; name=&quot;_ftn3&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span style=&quot;font-size:10.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[3]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; &lt;a href=&quot;https://github.com/bartdag/pymining&quot; style=&quot;color:blue; text-decoration:underline&quot;&gt;https://github.com/bartdag/pymining&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;div id=&quot;ftn4&quot;&gt;
&lt;p class=&quot;MsoFootnoteText&quot;&gt;&lt;span style=&quot;font-size:10pt&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;a href=&quot;#_ftnref4&quot; name=&quot;_ftn4&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span style=&quot;font-size:10.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[4]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; &lt;a href=&quot;https://marine27.github.io/TER/index.html&quot; style=&quot;color:blue; text-decoration:underline&quot;&gt;https://marine27.github.io/TER/index.html&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;
&lt;/div&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;p style=&quot;margin-bottom:11px&quot;&gt;&amp;nbsp;&lt;/p&gt;

&lt;div&gt;&amp;nbsp;
&lt;hr align=&quot;left&quot; size=&quot;1&quot; width=&quot;33%&quot; /&gt;
&lt;div id=&quot;ftn1&quot;&gt;
&lt;p class=&quot;MsoFootnoteText&quot;&gt;&lt;span style=&quot;font-size:10pt&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;, sans-serif&quot;&gt;&lt;a href=&quot;#_ftnref1&quot; name=&quot;_ftn1&quot; style=&quot;color:blue; text-decoration:underline&quot; title=&quot;&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span class=&quot;MsoFootnoteReference&quot; style=&quot;vertical-align:super&quot;&gt;&lt;span style=&quot;font-size:10.0pt&quot;&gt;&lt;span style=&quot;line-height:107%&quot;&gt;&lt;span style=&quot;font-family:&amp;quot;Calibri&amp;quot;,sans-serif&quot;&gt;[1]&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/a&gt; Lien pour le point pr&amp;eacute;cis de l&amp;rsquo;enregistrement o&amp;ugrave; se trouve le mot cible (utiliser la query pour trouver d&amp;rsquo;autres mots)&amp;nbsp;: &lt;a href=&quot;https://ct3.ortolang.fr/tools/trjsbrowser/trjs.html?f=/data/colaje/adrien/ADRIEN-34-4_03_26/ADRIEN-34-4_03_26.tei_corpo.xml&amp;amp;m=/data/colaje/adrien/ADRIEN-34-4_03_26/ADRIEN-34-4_03_26-480p.mp4&amp;amp;time=1380.0&amp;amp;nowave&quot; style=&quot;color:blue; text-decoration:underline&quot;&gt;https://ct3.ortolang.fr/tools/trjsbrowser/trjs.html?f=/data/colaje/adrien/ADRIEN-34-4_03_26/ADRIEN-34-4_03_26.tei_corpo.xml&amp;amp;m=/data/colaje/adrien/ADRIEN-34-4_03_26/ADRIEN-34-4_03_26-480p.mp4&amp;amp;time=1380.0&amp;amp;nowave&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;
&lt;/div&gt;

&lt;p style=&quot;margin-bottom: 11px; text-align: center;&quot;&gt;&amp;nbsp;&lt;/p&gt;