{"id":7786,"date":"2015-11-06T17:00:43","date_gmt":"2015-11-06T16:00:43","guid":{"rendered":"https:\/\/sciencetonnante.wordpress.com\/?p=7786"},"modified":"2015-11-06T17:00:43","modified_gmt":"2015-11-06T16:00:43","slug":"la-machine-a-inventer-des-mots-version-ikea","status":"publish","type":"post","link":"https:\/\/scienceetonnante.com\/blog\/2015\/11\/06\/la-machine-a-inventer-des-mots-version-ikea\/","title":{"rendered":"La machine \u00e0 inventer des mots, version Ikea"},"content":{"rendered":"<p><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/ikea-flatpack-furniture-1.jpg\"><img decoding=\"async\" class=\"alignleft wp-image-7797 size-medium lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/ikea-flatpack-furniture-1.jpg?w=300\" alt=\"ikea-flatpack-furniture-1\" width=\"300\" height=\"217\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/ikea-flatpack-furniture-1.jpg 728w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/ikea-flatpack-furniture-1-300x217.jpg 300w\" data-sizes=\"(max-width: 300px) 100vw, 300px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 300px; --smush-placeholder-aspect-ratio: 300\/217;\" \/><\/a>Vous connaissez le canap\u00e9 S\u00f6ft ? La commode Utrad ? L&rsquo;\u00e9tag\u00e8re H\u00e5ng ? L&rsquo;armoire Muskydd ? Le mixeur Skymfor ? La po\u00eble Kukv\u00e4de ? Le placard Kl\u00f6stig ? Le circuit Rundering ? La table Oljulstad ? Les rideaux Lykof\u00e5tsly ? Le bureau H\u00e5kmanedfol ? La chaise Sj\u00e4rganskig ?<\/p>\n<p>Eh bien contrairement aux apparences, ces noms ne font pas partie du v\u00e9ritable catalogue Ikea ! <strong>Ils ont \u00e9t\u00e9 fabriqu\u00e9s automatiquement par un algorithme<\/strong> qui s&rsquo;inspire de vrais mots su\u00e9dois pour en cr\u00e9er de nouveaux, qui \u00ab\u00a0sonnent su\u00e9dois\u00a0\u00bb mais n&rsquo;existent pas dans cette langue.<!--more--><\/p>\n<p>Les habitu\u00e9s de ce blog auront reconnu la m\u00e9thode que j&rsquo;avais utilis\u00e9e il y a quelques semaines pour vous proposer ma machine \u00e0 inventer des mots, au sujet de laquelle j&rsquo;avais fait cette vid\u00e9o, que je vous remets pour ceux qui ne l&rsquo;ont pas vue<\/p>\n<p>[youtube=http:\/\/www.youtube.com\/watch?v=YsR7r2378j0]<\/p>\n<p>Suite \u00e0 la vid\u00e9o, j&rsquo;avoue avoir \u00e9t\u00e9 surpris par le nombre de personnes qui me demandaient le code source ! Il se trouve que je l&rsquo;avais mis en partage sur <a href=\"https:\/\/scienceetonnante.com\/blog\/2015\/10\/16\/la-machine-a-inventer-des-mots-video\/\">le billet pr\u00e9c\u00e9dent<\/a>, mais sans trop faire gaffe \u00e0 la qualit\u00e9 de ce qu&rsquo;il contenait.<\/p>\n<p>Vu l&rsquo;enthousiasme g\u00e9n\u00e9ral et suite \u00e0 un certain nombre de questions et suggestions, j&rsquo;ai d\u00e9cid\u00e9 de r\u00e9crire <a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/machine.py\" target=\"_blank\" rel=\"noopener\">une version un peu plus propre du code<\/a>, et de la partager avec vous.<\/p>\n<p>Parmi les nouveaut\u00e9s de ce code, j&rsquo;ai surtout inclus plus de langues, gr\u00e2ce \u00e0 une excellente source : <a href=\"http:\/\/cgit.freedesktop.org\/libreoffice\/dictionaries\/tree\/\" target=\"_blank\" rel=\"noopener\"><strong>les dictionnaires du package hunspell, <\/strong><\/a>merci \u00e0 Samuel pour cette id\u00e9e brillante ! (<a href=\"http:\/\/science-etonnante.com\/WordsMachine\/data.zip\" target=\"_blank\" rel=\"noopener\">ici l&rsquo;archive contenant les dictionnaires que j&rsquo;ai l\u00e9g\u00e8rement adapt\u00e9s<\/a>)<\/p>\n<p>Dans le cas du fran\u00e7ais, on peut faire la comparaison avec <a href=\"http:\/\/www.pallier.org\/ressources\/dicofr\/dicofr.html\" target=\"_blank\" rel=\"noopener\">le dictionnaire que j&rsquo;avais utilis\u00e9 pour faire ma vid\u00e9o<\/a>. En fait ce dernier n&rsquo;en \u00e9tait pas vraiment un puisqu&rsquo;il s&rsquo;agissait plut\u00f4t d&rsquo;un corpus de mots issus d&rsquo;une analyse automatique des livres du projet Gutenberg. Ce corpus contenait donc beaucoup de mots au pluriel, ou bien de verbes conjugu\u00e9s \u00e0 tous les temps, ce qui influen\u00e7ait pas mal les mots que j&rsquo;avais obtenu dans la vid\u00e9o. Autre diff\u00e9rence, les livres du projet Gutenberg \u00e9tant suppos\u00e9s \u00eatre du domaine public, on trouvait pas mal de vieux textes, ce qui donnait \u00e0 mon avis ce c\u00f4t\u00e9 sympathiquement \u00ab\u00a0vieillot\u00a0\u00bb des mots que j&rsquo;avais g\u00e9n\u00e9r\u00e9s.<\/p>\n<p>Avec le dictionnaire fran\u00e7ais de hunspell, c&rsquo;est un chouilla moins fun je trouve, mais \u00e7a marche bien quand m\u00eame.<\/p>\n<p>Voici quelques mots \u00ab\u00a0fran\u00e7ais\u00a0\u00bb fabriqu\u00e9s al\u00e9atoirement :<\/p>\n<p><em>bilectique, pulablote, cyclisale, pluminer, gr\u00e9ricodire, tacatrapt\u00e9e, bardemper, cyclisale, gulatrie, extrabler, capapente, etc. <\/em>(<a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/output\/output_FR.txt\" target=\"_blank\" rel=\"noopener\">plein d&rsquo;autres mots ici<\/a>)<em><br \/>\n<\/em><\/p>\n<p>Gr\u00e2ce \u00e0 hunspell, j&rsquo;ai pu facilement \u00e9tendre l&rsquo;analyse \u00e0 d&rsquo;autres langues comme<\/p>\n<p><strong>l&rsquo;espagnol<\/strong>,<\/p>\n<p><em>cebille\u00f1o, dizner\u00f3n, perador, vitazar, mal\u00farda, menjez, cozonar, realsargados, etc. (<a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/output\/output_ES.txt\" target=\"_blank\" rel=\"noopener\">plein d&rsquo;autres mots ici<\/a>)<br \/>\n<\/em><\/p>\n<p><strong>l&rsquo;italien,<\/strong><\/p>\n<p><em>tiviatote, oritace, ingivolosi, cappidero, stezzano, crettoble, revanazio, nenteggiando, etc.<\/em>\u00a0 (<a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/output\/output_IT.txt\" target=\"_blank\" rel=\"noopener\">plein d&rsquo;autres mots ici<\/a>)<\/p>\n<p><strong>le hongrois,<\/strong><\/p>\n<p>k\u00e9rmegy, szapolt, hisz\u00f3, ujj\u00f3bbrak, csipitkolge (<a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/output\/output_HU.txt\" target=\"_blank\" rel=\"noopener\">plein d&rsquo;autres mots ici<\/a>)<\/p>\n<p>et donc surtout&#8230;le su\u00e9dois (<a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/output\/output_SE.txt\" target=\"_blank\" rel=\"noopener\">plein d&rsquo;autres mots ici<\/a>) ! Ikea n&rsquo;a qu&rsquo;\u00e0 bien se tenir !<\/p>\n<p>Ci-dessous, vous trouverez une comparaison des matrices de transition des diff\u00e9rentes langues, et qui montre que certaines langues sont plus contrast\u00e9es que d&rsquo;autres. Comme je le disais dans mon premier billet sur le sujet, <a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/output\/output_EN.txt\" target=\"_blank\" rel=\"noopener\">les mots anglais cr\u00e9\u00e9s par la machine<\/a> sonnent relativement mal, et je pense que c&rsquo;est \u00e0 relier avec le fait que la matrice est peut-\u00eatre moins contrast\u00e9e que dans les autres langues, ce qui conduit \u00e0 produire des mots plus al\u00e9atoires et moins typ\u00e9s.<\/p>\n<p>Mais je vous mets tout en t\u00e9l\u00e9chargement pour que vous puissiez vous faire votre id\u00e9e ! (<a href=\"http:\/\/www.science-etonnante.com\/WordsMachine\/machine.py\" target=\"_blank\" rel=\"noopener\">ICI LE CODE SOURCE<\/a>)<\/p>\n<h4>Fran\u00e7ais<\/h4>\n<h4><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr.png\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-7788 lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr.png?w=600\" alt=\"matrix_FR\" width=\"600\" height=\"600\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr.png 800w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr-300x300.png 300w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr-150x150.png 150w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr-768x768.png 768w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_fr-370x370.png 370w\" data-sizes=\"(max-width: 600px) 100vw, 600px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 600px; --smush-placeholder-aspect-ratio: 600\/600;\" \/><\/a>Anglais<\/h4>\n<h4><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en.png\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-7792 lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en.png?w=600\" alt=\"matrix_EN\" width=\"600\" height=\"600\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en.png 800w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en-300x300.png 300w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en-150x150.png 150w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en-768x768.png 768w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_en-370x370.png 370w\" data-sizes=\"(max-width: 600px) 100vw, 600px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 600px; --smush-placeholder-aspect-ratio: 600\/600;\" \/><\/a>Su\u00e9dois<\/h4>\n<p><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se.png\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-7791 lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se.png?w=600\" alt=\"matrix_SE\" width=\"600\" height=\"600\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se.png 800w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se-300x300.png 300w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se-150x150.png 150w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se-768x768.png 768w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_se-370x370.png 370w\" data-sizes=\"(max-width: 600px) 100vw, 600px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 600px; --smush-placeholder-aspect-ratio: 600\/600;\" \/><\/a><\/p>\n<h4>Italien<\/h4>\n<p><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it.png\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-7790 lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it.png?w=600\" alt=\"matrix_IT\" width=\"600\" height=\"600\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it.png 800w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it-300x300.png 300w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it-150x150.png 150w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it-768x768.png 768w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_it-370x370.png 370w\" data-sizes=\"(max-width: 600px) 100vw, 600px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 600px; --smush-placeholder-aspect-ratio: 600\/600;\" \/><\/a><\/p>\n<h4>Espagnol<\/h4>\n<p><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es.png\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-7793 lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es.png?w=600\" alt=\"matrix_ES\" width=\"600\" height=\"600\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es.png 800w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es-300x300.png 300w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es-150x150.png 150w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es-768x768.png 768w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_es-370x370.png 370w\" data-sizes=\"(max-width: 600px) 100vw, 600px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 600px; --smush-placeholder-aspect-ratio: 600\/600;\" \/><\/a><\/p>\n<h4>Hongrois<\/h4>\n<p><a href=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu.png\"><img decoding=\"async\" class=\"aligncenter size-large wp-image-7789 lazyload\" data-src=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu.png?w=600\" alt=\"matrix_HU\" width=\"600\" height=\"600\" data-srcset=\"https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu.png 800w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu-300x300.png 300w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu-150x150.png 150w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu-768x768.png 768w, https:\/\/scienceetonnante.com\/blog\/wp-content\/uploads\/2015\/11\/matrix_hu-370x370.png 370w\" data-sizes=\"(max-width: 600px) 100vw, 600px\" src=\"data:image\/svg+xml;base64,PHN2ZyB3aWR0aD0iMSIgaGVpZ2h0PSIxIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPjwvc3ZnPg==\" style=\"--smush-placeholder-width: 600px; --smush-placeholder-aspect-ratio: 600\/600;\" \/><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Vous connaissez le canap\u00e9 S\u00f6ft ? La commode Utrad ? L&rsquo;\u00e9tag\u00e8re H\u00e5ng ? L&rsquo;armoire Muskydd ? Le mixeur Skymfor ? La po\u00eble Kukv\u00e4de ? Le placard Kl\u00f6stig ? Le circuit Rundering ? La table Oljulstad ? Les rideaux Lykof\u00e5tsly ? Le bureau H\u00e5kmanedfol ? La chaise Sj\u00e4rganskig ? Eh bien contrairement aux apparences, ces noms<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[4],"tags":[69,48],"class_list":{"0":"post-7786","1":"post","2":"type-post","3":"status-publish","4":"format-standard","6":"category-mathematiques","7":"tag-langage","8":"tag-probabilites"},"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"post_mailing_queue_ids":[],"_links":{"self":[{"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/posts\/7786","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/comments?post=7786"}],"version-history":[{"count":0,"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/posts\/7786\/revisions"}],"wp:attachment":[{"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/media?parent=7786"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/categories?post=7786"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scienceetonnante.com\/blog\/wp-json\/wp\/v2\/tags?post=7786"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}