Đang tải... (xem toàn văn)
Definition Business Intelligence BI refers to the technology-driven process of collecting, analyzing, and presenting business data to support decision-making within an organization.. Key
Trang 1ASSIGNMENT 2 FRONT SHEET Qualification BTEC Level 5 HND Diploma in Computing
Unit number and title Unit 14: Business Intelligence
Trang 3Summative Feedback: Resubmission Feedback:
Lecturer Signature:
Trang 52 Evaluate how this tool help 2
II Boxplot chart 2
1 Analyze 2
2 Evaluate how this tool help 2
III Bar Chart 2
1 Analyze 2
Trang 62 Evaluate how this tool help 3
IV Correlation Matrix 33
1 Analyze 3
2 Evaluate how this tool help 3
D Explore the legal issues involved in the secure exploitation of business intelligence tools (P6) 34
I Legal issues 3
1 Data Privacy 3
2 Intellectual Property 3
3 Contractual Obligations 3
Trang 7II Security issues 3
1 Data handling and storage 3
2 Cybersecurity Threats 3
3 Bias, Fairness & Discrimination 35
III Business Intelligence Tools 36
Figure 2 Histogram about the adoption of Machine Learning in the enterprise
Figure 3 Bar diagram shows the number of language users based on level
Figure 4 Boxplot diagram for median salary of multiple work position 4
Figure 5 Business Process Diagram
Figure 6 Business Intelligence with Python
Figure 7 Python code in Spyder
Figure 8 Data Visualization by Python 1
Figure 9 Tableau
Figure 10 Tableau Desktop
Figure 11 Tableau visualization 1
Figure 12 Python visualization 1
Trang 8Figure 13 Libraries needed
Figure 14 Filter data
Figure 15 Change Column data
Figure 16 Before change
Figure 17 After change
Figure 18 New columns
Figure 19 Code for histogram
Figure 20 Histogram in Python 1
Figure 21 Code for boxplot 1
Trang 9Figure 22 Boxplot in Python 1
Figure 23 Code for correlation matrix 1
Figure 24 Correlation Matrix in Python 1
Figure 25 Code for bar chart
Figure 26 Barchart in Python - C user
Figure 27 Barchart in Python R user –
Figure 28 Barchart in Python SQL user – 2
Figure 29 Barchart in Python - Python user 2
Figure 30 Save into new file 2
Figure 31 Histogram in Tableau 2
Figure 32 Boxplot in Tableau 2
Figure 33 Barchart in Tableau 2
Figure 34 Histogram made by using Python (Spyder) 2
Figure 35 Histogram made by using Tableau
Figure 36 Boxplot chart made by using Tableau 2
Figure 37 Boxplot chart made by using Python (Spyder) 2
Figure 38 Barchart in Tableau 2
Figure 39 Barchart in Python - C user
Figure 40 Barchart in Python - R user
Figure 41 Barchart in Python - SQL user 3
Figure 42 Barchart in Python - Python user 3
Figure 43 Correlation Matrix made by using Python (Spyder) 3
Trang 11A Determine, with examples, what business intelligence is and the tools and techniques associated with it (P3)
1 Definition
Business Intelligence (BI) refers to the technology-driven process of collecting, analyzing, and presenting business data to support decision-making within an organization The primary goal of BI is to help organizations gain a better understanding of their internal and external data in order to make informed business decisions
Figure 1 Business Inteligence
There are many definitions and interpretations of BI However, the most practical one is that BI is a health management profile, storing information for regular check-ups Then, we will know our health in the future, what diseases we might have, and where our health will be weak,
Trang 12, g , ,and it will notify us of that Businesses undergo a "health check-up." BI will reveal what the business needs to ensure its health That is BI in the most practical sense
Key activities in the field of Business Intelligence includ e:
Data Collection: Organizations gather data from various sources, including internal databases, external market data, customer data, and competitive intelligence
Data Processing: The collected data often needs to be processed and transformed into a suitable format for analysis This may involve cleaning, aggregating, and structuring the data
Data Analysis: BI involves using various analytical tools to explore data, discover trends, and gain insights into the relationships between different factors affecting the business
Trang 13Reporting and Visualization: BI tools provide capabilities for creating and customizing reports, dashboards, and visualizations These help managers and decision-makers understand the information and make decisions based on data
Predictive Analytics and Data Mining: Some BI systems incorporate predictive analytics and data mining techniques to forecast future trends and support strategic decision-making BI enables organizations to enhance their flexibility and responsiveness in decision-making, mitigate risks, and optimize business performance BI tools are often integrated with other enterprise systems to create a comprehensive information system
2 Reason why I choose this business model
Trang 14Figure 2 Histogram about the adoption of Machine Learning in the enterprise
The histogram illustrates the uptake of Machine Learning in distinct sectors It indicates that the Computer/Technology industry boasts the highest level of adoption, followed by the Academics/Education sector Evidently, these two sectors hold notably larger shares of the market compared to others, suggesting substantial potential and ample opportunities Consequently, our business will primarily target these sectors Given the rapid evolution of technology, engagement in the computer/technology industry promises numerous possibilities and innovation Furthermore, this sector represents the largest market, nearly double the size of the second-largest
Trang 15market With its continuously expanding potential, this industry appears to be the most advantageous space for our company's involvement.
Our business will concentrate on devising projects to address challenges and aid enterprises in diverse capacities, including A.I training, developing support software, and meeting the specific needs of businesses within the computer/technology realm Consequently, meticulous attention must be paid to the screening of human resources to ensure that we assemble the most suitableteam for these endeavors
Trang 16Figure 3 Bar diagram shows the number of language users based on level
Trang 17It can be seen from the chart above that a prevalence of master's level users across most languagesfollowed by bachelor's and doctorate-level users This suggests an opportunity to target a talent pool that is both abundant and capable of attaining a master's degree, making it an appealing segment for recruitment Across all languages surveyed, master's degree holders consistently represent the largescohort
Moving on to the number of users for each language, focusing specifically on the master's level, it is apparent that Python, R, and SQL stand out as the most abundant and dominant languages, surpassinothers by a substantial margin Consequently, these languages will serve as the primary focus for our company's project The considerable user base for these languages translates to a large potential talenpool, simplifying the recruitment process and enhancing negotiating capabilities Python will serve as our primary language for process development with over three thousand users, followed by R with over two thousand users, and SQL with approximately one thousand users
Figure 4 Boxplot diagram for median salary of multiple work position
Through the boxplot chart above, we can see that job groups directly related to data have quite high average salaries This is understandable because their job is quite heavy and must
Trang 18q g g j q y
provide an accurate and quick view to be able to grasp the market quickly as well as leapfrog market trends Meanwhile, jobs like programming or statistics have salaries that are not too high but not too low either That's because their job is to determine and ensure the process As for analysts, their work is multi-faceted It is impossible to know in advance Today may be as predicted, but the following days are uncertain Therefore, the more you spend on good analysts, the more useful and accurate information you will receive in return That's the reason why their average salary is so high because everyone wants to find a talented analyst to be able to grasp themarket accurately and quickly
Having an overview of the market, market requirements and grasping trends to be able to stay ahead of the market will bring a huge advantage to the company when we can be one of the first units to do what the market needs That way, we will eliminate a large number of direct competitors in the industry because the bigger the pie, the more people will want a piece
Trang 193 Business Process
Trang 20Figure 5 Business Process Diagram
Trang 21 Market research and analysis
To ensure the stability of our business operations, we will commence with comprehensive market research and analysis, enabling us to approach the market methodically and insightfully This rigorous market research will serve as a foundational step, providing the company with a clear direction for future projects
Our initial focus will be on conducting market research and identifying industries in which we can develop solutions This will pave the way for our company's entry into the market Following the planning phase, the next crucial step involves selecting the most suitable programming language for the project This is vital, as the language being the most commonly used may not necessarily align with the specific requirements of the solution Subsequently, we will proceed to recruit the necessary personnel for the project
The budget is undeniably a critical factor Without adequate funding, operational progress is impeded It is imperative to carefully calculate expenses to determine the budget and forecast costs for the entire plan If the budget falls short, it will hinder the project's execution Conversely, with sufficient funding, we can proceed to the final stage of this phase, which involves recruiting the necessary personnel for the project and initiating its implementation
Solution Design and Development
In this stage, ideas are generated, concepts are formulated, and the foundational framework of the software solution is delineated This phase is pivotal as developers define the problem they aim to address and conceive an effective software solution Subsequently, the "Determine scope, features,functionalities, legality" process follows the design phase This stage involves evaluating the software's functionality, anticipated features, and scope, as well as assessing the legal aspects ofsoftware development, encompassing issues related to intellectual property rights and compliance with data protection laws.
A crucial aspect that follows is the assessment of the chosen programming or development language to determine its appropriateness for realizing the envisioned design Selecting the right language is a critical step in the software development process, as it significantly impacts the program's effectiveness, performance, and usability If the chosen language is deemed unsuitable, the feasibility of modifying the project or language is examined This entails assessing whether adjustments can be made to align the design and the chosen language If modifications are feasible, the process loops backto the design stage, signaling the need for a reassessment and redesign of the solution.
Trang 22to the design stage, signaling the need for a reassessment and redesign of the solution
However, if adjustments are not feasible, the process regresses to the language research stage toidentify an alternative language that aligns with the plan The process then recommences from the beginning until the desired outcome is achieved
Upon confirmation of the suitability of the chosen language, the process advances to the final stage, which entails creating a prototype In this phase, a prototype or preliminary version of the software is developed, enabling developers to assess its functionality and make necessary adjustmentsbefore final deployment
Prototyping
Trang 23The initial step in this phase involves developing prototypes, which are early versions of the product These prototypes serve as a tangible representation of the product and allow developers toevaluate its functionality
Subsequently, we progress to the testing and feedback stage to validate the prototypes Here, extensive testing is conducted on the prototypes, and stakeholders or users provide their feedback toidentify any weaknesses or potential areas for improvement
Following the validation process, an assessment is made to determine if the prototypes meet our expectations and adhere to the specified requirements If the response is negative, the process revertto the initial phase, signifying the need for further refinement and development of the prototypes
If the prototypes successfully fulfill the requirements, the process advances to the phase of refining and enhancing the solution Here, the solution is adjusted and improved based on feedback and testresults from the validation phase
A crucial inquiry that arises is whether additional prototyping is necessary This evaluation determines if further prototyping is essential If the answer is "Yes", the process returns to the initial step for further development of prototypes If the answer is "No", the process proceeds to the final step, "Proceed to development", which marks the transition from prototyping to the actual development of the product.
Further Development and Deployment
The process commences with the commencement of full-scale development During this phase, the product is concretely developed with the prototypes and design serving as a guide Developers translate the design into a functional product in this crucial stage
The subsequent step entails deploying the solution on suitable infrastructure This phase involves establishing the required infrastructure to support the solution and deploying the developed product onto this infrastructure
Following the deployment phase, integration testing and quality assurance activities are undertaken This marks the outset of the product's functional and performance testing phase Verifying that the product aligns with predetermined quality standards and requirements falls within the scope of quality assurance
The subsequent phase involves assessing the product's quality This step evaluates if the product
Trang 24meets the defined quality standards If the response is negative, it indicates the necessity to addresserrors and adjust the product, signaling a requirement for further refinement and bug fixing Upon successful verification of the quality, the process proceeds to the final stage, which is the product launch This stage signifies the official introduction of the product to the market
Product launch and Maintain
This phase marks the culmination of the process Its primary task is to develop and implement marketing strategies for our product This entails formulating a comprehensive marketing strategy, defining the target audience, understanding their needs, and positioning the product to address thoseneeds
Trang 25The subsequent action step focuses on product launch and customer acquisition Introducing the product to the market and acquiring clients are the primary objectives of this phase, involving activities such as sales tactics, advertising campaigns, and promotional events
Following the product launch, the subsequent phase emphasizes customer support, feedback collection, and survey administration This stage underscores the significance of feedback and customer service in the development of new products Conducting surveys, gathering feedback, and providing customer support are integral components of this phase to gain insight into customers' experiences with the product
The next step in the progression evaluates whether the feedback indicates high user satisfaction with the product If the response is negative, the process moves to the stage of updating and maintaining solutions, denoting the need for further refinement and improvement of the product
Conversely, if the feedback indicates high customer satisfaction, the process advances to the final step, prioritizing product enhancements and continuous development This stage underscores the importance of ongoing development, even in the presence of significant user satisfaction It involves refining the product's features and making adjustments in response to evolving consumer demands andindustry trends
1 Python
Python is a versatile and powerful programming language widely used in various fields, including data analysis, artificial intelligence, web development, and many other applications It has simple, readable syntax, making it easy for beginners to quickly learn and master
Trang 26Figure 6 Business Intelligence with Python
In the realm of Business Intelligence (BI), Python plays a crucial role in several aspects: Data Analysis: Python provides powerful libraries such as Pandas, NumPy, and SciPy,
facilitating efficient data analysis and processing
Trang 27Data Visualization: Libraries like Matplotlib and Seaborn help create charts and graphs to visually represent data in an understandable manner
Data Connection and Retrieval: Python has libraries like SQLAlchemy and PyODBC, enabling connection and retrieval of data from various sources, including databases, Excel files, and APIs
Machine Learning and Prediction: Python is a powerful tool in the fields of artificial intelligence and machine learning Libraries like Scikit-learn and TensorFlow support the building of prediction and classification models
Automation and Automated Reporting: Python can be used to automate repetitive tasks in BI processes and generate automated reports
Web Development and Integration: Python has frameworks like Flask and Django, aiding in building web applications and integrating BI into organizational systems
Open Source and Large Community: Python is an open-source language with a large programming community This means abundant resources, libraries, and community support
In our case, we have applied Python for data analysis and data visualization through the Spyder software, which integrates several libraries such as Pandas and NumPy for numerical analysis Additionally, it is further supported by the Matplotlib and Seaborn libraries (installed externally, or integrated if using Anaconda with Spyder) Python proves to be quite flexible in data processing; however, for visualization, it may not stand out as much Therefore, we use Tableau to enhance the visual aesthetics compared to Python
Trang 28Figure 7 Python code in Spyder
Trang 29Figure 8 Data Visualization by Python
2 Tableau
Tableau is a powerful data visualization and business intelligence (BI) platform that allows organizations to turn raw data into meaningful insights It provides interactive and shareable dashboards, reports, and charts, making it easier for businesses to understand complex datasetsand make data-driven decisions
Trang 30Figure 9 Tableau
Tableau, with its robust set of features, stands out as a leading data visualization and business intelligence platform Its intuitive data visualization capabilities empower users to craft visually stunning and interactive dashboards effortlessly, eliminating the need for extensive coding or technical expertise The platform's strength lies in its connectivity, offering seamless integration with a diverse range of data sources, including databases, spreadsheets, and cloud-based repositories Tableau's real-time data analysis feature enables users to gain quick insights into dynamic trends, while the drag-and-drop functionality simplifies the creation of charts, graphs, and maps The ability to blend data from various sources ensures a comprehensive
Trang 39Figure 17 After change
Figure 18 New columns
Our initial step involves creating a histogram to visualize the distribution of AI/machine learning adoption across different industries To achieve this, we will tally the occurrences of industries in the dataset, specifically in column Q24, representing the current industry of the respondents Seaborn will be employed to generate the histogram based on the values in column Q24 Subsequently, we will utilize matplotlib to modify the x-axis label, ensuring it provides more context than just "Q24." Additionally, we will rotate the names of the industries 90 degrees for
Trang 40improved readability
Figure 19 Code for histogram