Wednesday, May 6, 2020

Using Technology for Corpus Studies

Question: Write about theUsing Technology for Corpus Studies. Answer: English Language and Linguistics The data chosen is the one on American Soap Opera. The tool of analysis will be the use of word clouds. The technological application in this analysis is found to be vital so as to get the best outcome from the data and in so doing, realize the best combinations. American Soap is a good data set as it contains huge volumes of data to be used. The fact that it can be analyzed by all other tools helps us see the disadvantage or advantage of using a tool on a particular data set. Word cloud is a graph of word frequency in a particular given text. They are easy to generate. They work with the sense that the most frequent word appears largest and the size of other words decreases with relative decrease in frequency. In this view American soaps would be interesting as they are practical. Word clouds are appealing to the eye and thus have slim chances for human error (Lee, Riche, Karlson Carpendale, 2010) In the analysis of the American soap operas using word cloud of all soaps combined the most frequent words appear larger than others. This would therefore give a summative overview of ideas that are uniformly addressed by all American soap operas. However, in this analysis another approach to be taken will be to analyze each soap opera individually by generating a word cloud for each. This would bring an overview of the thematic concerns addressed in each soap opera. The best-selling soap would then be identified and an educated guess of favorite themes met. Furthermore, analysis will be quick as the generation does not require a lot of time as compared to other tools. It will also be fast since the human eye will quickly note the prominent texts. (Carmel, Uziel, Guy, Mass Roitman, 2012) References Lee, B., Riche, N. H., Karlson, A. K., Carpendale, S. (2010). Sparkclouds: Visualizing trends in tag clouds. IEEE transactions on visualization and computer graphics, 16(6), 1182-1189. Carmel, D., Uziel, E., Guy, I., Mass, Y., Roitman, H. (2012). Folksonomy-based term extraction for word cloud generation. ACM Transactions on Intelligent Systems and Technology (TIST), 3(4), 60.

No comments:

Post a Comment