{"id":122296,"date":"2018-11-18T16:13:31","date_gmt":"2018-11-18T10:43:31","guid":{"rendered":"https:\/\/staging-imarticus.kinsta.cloud\/?p=122296"},"modified":"2022-10-13T05:41:18","modified_gmt":"2022-10-13T05:41:18","slug":"how-machine-learning-is-saving-the-indian-vernacular","status":"publish","type":"post","link":"https:\/\/imarticus.org\/blog\/how-machine-learning-is-saving-the-indian-vernacular\/","title":{"rendered":"How Machine Learning Is Saving The Indian Vernacular ?"},"content":{"rendered":"<p><span style=\"font-weight: 400;\">In a nation riddled with countless cultures, unending dialects and infinite separations, the term \u2018melting pot&#8217; comes to mind. It&#8217;s common for the typical Indian being confused with the local tongues when treading into unfamiliar territories. <\/span><br \/>\n<span style=\"font-weight: 400;\">Fortunately for the millions of Indians beguiled by such problems, <\/span><strong><a href=\"https:\/\/imarticus.org\/postgraduate-program-in-data-science-analytics\/\">machine learning courses<\/a><\/strong><span style=\"font-weight: 400;\"> and a number of <\/span><span style=\"font-weight: 400;\">data science tools<\/span><span style=\"font-weight: 400;\"> is proving to be a much-needed relief for preserving and keeping those languages intact.<\/span><\/p>\n<h2><b>Connecting Data To Language<\/b><\/h2>\n<h3><strong>Big Data<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">This has significantly boosted the outlook for interdisciplinary research that has allowed researchers across the country to link the aspects of linguistics and fragment all dialects to a condensed format that can be edited easily.Until now, several companies have taken to using an aggregator system to create a platform that translates the language into any other without sacrificing minor details. <\/span><span style=\"font-weight: 400;\">Several years ago, a research project under the name Technology Development for Indian Language was created by the government to scrape all the major Indian languages for <\/span><span style=\"font-weight: 400;\">data science purposes<\/span><span style=\"font-weight: 400;\">. <\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">One such platform that has been making strides is the e-Bhasha platform that is making content available for citizens in their language. It was created as a <\/span><span style=\"font-weight: 400;\">big data<\/span><span style=\"font-weight: 400;\"> project in 2015 and has become a starting point for many linguistic researchers.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">As the number of internet users in India grew more than 28 per cent and is expected to be a $6.2 billion industry per year, international groups are jumping on the bandwagon to appeal to the common man.<\/span><\/li>\n<\/ul>\n<h2><b>Playing With The Locals<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">Seeing the enormous benefits of tapping into local consumers, big groups like Google set out to create the Google Brain which is essentially an extensive neural network to develop human language from the get-go. <\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Aspects of this have been incorporated into Google Assistant as well, having translated content from more than 500 million monthly users and 140 billion words per day in as many 158 languages.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">The craze began by the year 2013 when e-commerce was still taking root in the country and was challenged by the numerous languages that consumers had in the country.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Websites like Flipkart and Snapdeal dealt with local language content for mobile websites as far back as 2015.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Reports suggest that Marathi, Gujarati, Tamil, Punjabi and Malayalam represented over 75 per cent of searches on Google in the very same languages. What&#8217;s even more interesting is that more than 73% of people surveyed are willing to go completely digital if the system communicates in their own language. \u00a0\u00a0<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Facebook has raised the number of Indian languages for posting to almost 12 but still lacks regional pages that use the same kind.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Small firms in India are collecting as much textual Corpus for languages available using translation services like Reverie, Process9 and IndusOS. \u00a0<\/span><\/li>\n<\/ul>\n<h2><b>The Technology Used <\/b><\/h2>\n<ul>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Most companies would confess to the use of neural networks for developing such programs, but the primary machines behind such global endeavors has been some rather sophisticated algorithms.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">The newest additions to the industry happen to be some enhanced versions of the <\/span><span style=\"font-weight: 400;\">Hadoop<\/span><span style=\"font-weight: 400;\"> MapReduce extension. A significant feature of the software is the ability to find linguistic linkers between similar words and compound phrases which makes translations more concrete. Some stellar packaged additions to the SPSS Modeler system too have taken place that is helping companies handle large corpuses.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">At the same time, marketing groups are using modified techniques to feed invoice data collected from average consumers which are being sent into what\u2019s being called a \u2018global corpus data set.\u2019<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">Likewise, teams across the country in data collection firms are hiring data collection engineers to converse and accumulate conversational audio recordings both in rural and urban areas.<\/span><\/li>\n<li style=\"font-weight: 400;\"><span style=\"font-weight: 400;\">The main subject remains heavily invested in cross-directional neural networks many of which are using data analysis tools and <strong>machine learning tools<\/strong> like Tensor Flow from Google and IBM Watson.<\/span><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>In a nation riddled with countless cultures, unending dialects and infinite separations, the term \u2018melting pot&#8217; comes to mind. It&#8217;s common for the typical Indian being confused with the local tongues when treading into unfamiliar territories. Fortunately for the millions of Indians beguiled by such problems, machine learning courses and a number of data science [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_mo_disable_npp":"","_lmt_disableupdate":"no","_lmt_disable":"","footnotes":""},"categories":[23],"tags":[780,804,845,859,860],"class_list":["post-122296","post","type-post","status-publish","format-standard","hentry","category-analytics","tag-machine-learning-courses-online","tag-machine-learning-courses","tag-big-data-hadoop-training-courses","tag-machine-learning-courses-in-india","tag-machine-learning-toos"],"acf":[],"aioseo_notices":[],"modified_by":"Imarticus Learning","_links":{"self":[{"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/posts\/122296","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/comments?post=122296"}],"version-history":[{"count":0,"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/posts\/122296\/revisions"}],"wp:attachment":[{"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/media?parent=122296"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/categories?post=122296"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/imarticus.org\/blog\/wp-json\/wp\/v2\/tags?post=122296"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}