Sep'18

The IUP Journal of Information Technology

Focus

In this era of Information Technology, data is virtually everywhere whether its existence is known to someone or not.

Organizational processes generate huge amounts of data. The usability of this data depends on innovative and useful applications conceived by an organization. Data is perceived as a commodity in today's world. It has enormous value which is to be discovered within the realm of the business. Systematic approaches to capture and analyze this data help organizations gain very deep insights into the behavior of their customers and markets. Data, with proper handling, can improve the operational efficiency of an organization and has potential to radically change the business landscape.

Given the above proposition, data analytics has taken the center stage in many organizations of late. The rise of data analytics gave birth to a new field of study. Several tools and techniques, both old and new, are available to deal with data analytics or data science. Nevertheless, ample research opportunities exist in order to deal with challenges of real-time data processing, data storage and transport, predictive analytics as well as effective data visualization.

The first paper, "Reflecting Design Considerations: An End-to-End Case Study on Preparing Cricket Data Available on Net Analysis Ready", by Subhasis Ray and Kalyan Sengupta, presents an approach to trap the potential of the huge amounts of data available on various webpages on the Internet. Following a case study approach, the authors have demonstrated that the available data, which is often unusable in its inherent form, can be made highly valuable after achieving a good amount of transformation, aggregation and summarization.

Indexing techniques play a very crucial role in information retrieval domain. Most of the queries against large heterogeneous distributed databases and data warehouse systems are complex and often iterative. Right index structures are essential to improve the performance of these queries, especially the ad hock ones. The next paper, "Parsing Set of Queries to Obtain Parse Matrix for Table Index Evaluator and Recommender", by Shefali Naik, proposes an approach for table index evaluation and recommendation. According to the author, the implemented system is effective and useful in handling indexing in databases.

Sentiment analysis is a well-known text analysis approach to know the tone of a piece of text such as tweets, comments and feedbacks. The applications of sentiment analysis in business have a lot of importance, particularly in brand management. The last paper, "A Survey of Different Opinion Mining Methods", by Shubhashree Acharya and Navjyotsinh Jadeja, presents various techniques used in sentiment analysis considering different application domains. The authors also provide a pointer for further investigation on application of sentiment analysis in the field of education in order to improve the quality of educational institutions and their delivery.

-A C Ojha
Consulting Editor

Article   Price (₹)
Reflecting Design Considerations: An End-to-End Case Study on Preparing Cricket Data Available on Net Analysis Ready
100
Parsing Set of Queries to Obtain Parse Matrix for Table Index Evaluator and Recommender
100
A Survey of Different Opinion Mining Methods
100
Articles

Reflecting Design Considerations: An End-to-End Case Study on Preparing Cricket Data Available on Net Analysis Ready
Subhasis Ray and Kalyan Sengupta

The use of Internet as a source of secondary data is becoming more popular day by day. Websites are made up of webpages that contain a huge volume of useful information in textual form. However, webpages are coded using text-based mark-up languages (e.g., HTML, XHTML, XML, etc.) to facilitate end-user viewing rather than any automated use of them. This has led to a new science called web scraping that fetches webpages and then extracts data for future use. Many organizations have picked up this business opportunity to come up with efficient web scraping tools. The paper exposes the readers to how data can be sourced from the internet for scientific or commercial purpose. This elaborates on the available design options for data fetching, extracting, validating and transforming in the absence of any end-to-end tool or to supplement a tool. This is followed up by a specific case study which deals with reactive analysis of structured data from multiple predetermined sources/pages. This paper concludes that design considerations for web scraping have to be dynamic. Neither traditional copy-and-paste nor trapping feeds using Application Programming Interfaces (API) nor Java, Python or R programming nor the end-to-end tool available is uniformly better than the rest.


© 2018 IUP. All Rights Reserved.

Article Price : Rs.100

Parsing Set of Queries to Obtain Parse Matrix for Table Index Evaluator and Recommender
Shefali Naik

The use of appropriate indexing improves the performance of transactions in heterogeneous distributed database, whereas inappropriate or no indexing deteriorates the same. Properly designed index leads to faster data access, which ultimately improves the execution of transactions. Various Relational Database Management Systems (RDBMS) and third-party tools exist, which provide suggestion for index management, but up to certain limits. These tools provide index suggestion with limited and simple queries. They do not analyze or suggest index for aggregate queries, sub-queries and other complicated queries. The applications which access data from heterogeneous databases need an index evaluator and recommender to analyze and recommend indexes for tables. For this type of multiple heterogeneous distributed databases, Table Index Evaluator and Recommender (TIER) is proposed which takes set of queries as inputs. Queries in the set are based on local and remote tables (tables which are distributed on various RDBMS). In order to recommend indexes, the fields which are mentioned in WHERE and HAVING clauses of inputted set of queries should be parsed. Besides this, the total frequency of each field table-wise and overall is required. The parsed fields with frequency result in Parse Matrix (PM) and the obtained PM is used by TIER for further processing. In this paper, the algorithm to obtain Clause Matrix (CM) and PM is described.


© 2018 IUP. All Rights Reserved.

Article Price : Rs.100

A Survey of Different Opinion Mining Methods
Shubhashree Acharya and Navjyotsinh Jadeja

The paper focuses on the utilization of sentiment analysis techniques in various application domains and presents the major part of the work done in the field of sentiment mining or opinion mining using the techniques and tools of sentiment analysis. The paper gives a brief idea regarding the comparison of the techniques and the importance of the dataset in acquiring the desired outcomes.


© 2018 IUP. All Rights Reserved.

Article Price : Rs.100