Big Data Analytics

City: Mississauga
Two-Year with Co-op Ontario College Graduate Certificate
Sep Closed


As a big data analyst you will collect, analyze, and interpret large and complex sets of data using various statistical and analytical tools, in order to identify patterns, trends, and insights that can be used to inform business decisions and strategies.

Developing new ways to interpret large amounts of information collected through web sites, transactions, records, and images can help provide solutions to many business, social, society, and technological challenges of today. Big data enables users to make informed decisions and better predict future outcomes. 

The Lambton College Big Data Analytics, Ontario College Graduate Certificate is focused on utilizing big data technology for unstructured data to help guide executive, management, and industry decision-making.

The innovative curriculum will focus on topics such as information administration, development, project management, and business communications. In combination with Business Intelligence tools, data analytics, and cloud computing, students will blend theoretical knowledge with hands-on skills to learn how to capture, collect, curate, search, analyze and store complex data sets that are utilized by organizations to guide business decisions. The final semester will enhance all learning outcomes through a co-operative education work term or applied project.

In addition to a full range of enhanced virtualization deployment skills, data analytics, cloud computing theory, project management and business communications students will apply these skills to a variety of cutting-edge open-source and vendor-specific virtualization solutions. 

Mississauga - A Great Place to Study

Hear from our staff and students about why our Mississauga campus gives you a great, well-rounded education in a bustling city centre.

Achieve your goals while immersing yourself in Canadian culture.

This Lambton College program is licensed to and delivered by Queen's College (PDF), a licensed private career college in Mississauga, Ontario. Students who are registered at are students of a public college and as such, will receive full credit from Lambton College for all Lambton College courses completed at the Queen's College campus in Mississauga. Students who meet program graduation requirements will graduate with a credential from Lambton College. Students may be scheduled to have classes on Saturdays.

Admission Requirements

  • A university degree in computers, mathematics, engineering, statistics, or a related discipline

The admissions process is competitive and meeting the minimum academic requirements does not guarantee admission.

Lambton College reserves the right to alter information including admission requirements and to cancel a program or course at any time; to change the program curriculum as necessary to meet current competencies or changes in the job market; to change the pathways to third-party certification bodies; or to withdraw an offer of admission both prior to and after its acceptance by an applicant or student because of insufficient applications or registrations or over-acceptance of offers of admission. In the event Lambton College exercises such a right, Lambton College's sole liability will be the return of monies paid by the applicant or student to Lambton College.

English Language Requirements

  • IELTS of 6.5
    IELTS of 6.0 + Completion of EAP-3106 (English for Academic Purposes) during the first term of study

- or -

  • TOEFL iBT 79
    ITOEFL 70 + Completion of EAP-3106 (English for Academic Purposes) during the first term of study

- or -

  • Passed Lambton Institute of English placement test

Please Note: IELTS is the only proficiency score accepted by the Study Direct Stream (SDS) program. Additional country-specific requirements may also be applicable.

Meeting the minimum English requirements does not guarantee admission. Students with higher English proficiency scores will receive priority in the admission assessment process. Not all students will qualify for EAP-3106 in place of the required IELTS or TOEFL test scores.
An image of two students stying in the common area with book shelves behind them.
A photo of the iconic Abosolute World towers in Mississauga with clear sky.
International students sitting in the lunch room area with cafeteria in background.


  • Term 1 $9,200.00
  • Term 2 $8,500.00
  • Term 3 $9,200.00
  • Co-op Term $0.00
Total Cost of Program

Tuition fees are estimates and are subject to change each academic year. Fees do not include books (unless specifically noted), supplies or living costs.

Lambton College reserves the right to alter information including admission requirements and to cancel at any time a program or course; to change the location and/or term in which a program or course is offered; to change the program curriculum as necessary to meet current competencies or changes in the job market; to change the pathways third-party certification bodies; or to withdraw an offer of admission both prior to and after its acceptance by an applicant or student because of insufficient applications or registrations or over-acceptance of offers of admission. In the event Lambton College exercises such a right, Lambton College’s sole liability will be the return of monies paid by the applicant or student to Lambton College.

Additional Fees

WIL Project Fees

Students who are not successful in securing a co-op or fail to meet the co-op requirements will need to register in CPL-5559 WIL Project.

There is an additional fee of $2,480 for each student enrolled in the WIL Project course.


The anticipated cost for textbooks in this program is approximately $500 - $700 per term. This amount accounts for both mandatory textbook costs (included in tuition fees) as well as textbook fees not included in your tuition fee amount.

Important Dates, Deadline & Late Fees

For additional information on registration dates, deadlines and late fees please refer to Registration Dates and Deadlines.

Student Fees

A student services fee is included in your tuition.

Health Insurance Coverage

Emergency medical insurance is mandatory for all international students at Lambton College. This includes students who are full-time and part-time and who are on a co-op. This insurance is provided by - a third party insurance provider.

See Insurance Costs & Details


Technology Requirements

In order to keep pace with the requirements of each and every course in your program, Lambton College requires that each student have access to a laptop while studying at our college.


Big Data Tools

This course overviews different big data tools such as Hadoop,Spark, HBase, Phoenix, Cassandra, MongoDB, Sqoop, Flume, Kafka, Oozie, etc. Studentswill have a detailedunderstanding of how various big data tools are used to solve different businessproblems. They are introduced to analyzing, parsing, splitting, modifying, and identifying key correlations between large and diverse data sets. Students then apply methods and tools to efficiently query, parse, and display raw data sets. They programmatically extractdata from varioussources and file formats. Pythonand the Pandasenvironment are utilized for the development.

Data Encoding Principles & Collection Methods

This course provides hands-on experience in data management and data encoding. Students solve real-world problems to examine the key issues impacting the data management function and costs. They discuss the issues impacting legal, regulatory, privacy, recordkeeping, information accessibility, knowledge management, governance, quality, and accountability for data storage and data repositories. Students will compare and apply technologies, standards, and approaches used to encode, encrypt, secure, migrate, and synchronize information between different systems and file formats. Using Python, they will implement encoding and encryption techniques such as one-hot encoding, binary encoding, AES encryption, DES encryption, etc.

Data Technology Solutions

This course introduces the core concepts of Big Data Technology Solutions with hands-on practices in business process modelling, data analysis, and data modelling. Students install, configure, administer, and optimize a Hadoop ecosystem. They overview technologies such as SAS, R, Python,Gephi, Tableau, Simba,AWS Lambda, Hadoop, Spark, Azure, and SAP. Students will study build-versus-buy considerations, application life-cycle management, design patterns, etc. They will be able to choose specialized technology solutions to support deep data analytics and optimize the big data ecosystem by reducing data movement and applying performance-tuning measures to a Hadoop parallel-processing environment.

Big Data Fundamentals

This course provides a fundamental overview of big data and existing big data frameworks. Students use a data-driven approach to leverage the features of big data to improve a company's business components, such as business processes, operational models, and business models, to handle tasks from sales to marketing, operations, supply chain, human resources, and finances. They understand how big data is used in different industries, for example, healthcare, financial institutions, pharmaceutical companies, manufacturing, and retail. Students will understand the impact of key big data features variety, velocity, and volume by analyzing the best practices for big data and corporate data governance. Students will also explore different roles, the use of artificial intelligence and robotics, security and privacy policy, and informatics in a big data environment. Students will have hands-on expertise in Hadoop, MapReduce, Yarn, Apache Hive, and Pig.

Introduction to Programming for Big Data

This course teaches the essential concepts of everyday computer programming used to solve real-world problems. Students use the fundamental data representation and processing constructs required to solve various problems. They will develop algorithm-based solutions to data processing problems using workflow ideas such as sequence, loop, and selections. The course uses methodical techniques to train the students to write programs that solve well-specified problems. Moreover, the course introduces selected basic problem-solving approaches, such as decomposing data processing problems into numerous tasks whose functions are demonstrated within a specified workflow. The focus is given to the skillset development for introductory programming mastery, with a substantial concentration on the basic building blocks of computer programs and the associated ideas and principles. At first, a procedural context is taught to understand the requirements for sequential processing and control flow statements. Next, students are introduced to Object Oriented Programming and learn classes, objects, and different features of Object-Oriented Programming. The labs and assignments are based on Python and ensure the development of the required competencies for the development of program solutions to problems of sufficient complexity and relevancy.

NoSQL Database

This course will provide the students with the core concepts of NoSQL databases. It explores the four types of NoSQL databases, i.e., Document-oriented, Key-Value Pair based, Column-oriented, and Graph-based databases. Students will explore different ways to search, create, and analyze data using the MongoDB NoSQL database. They will have hands-on experience of learning MongoDB design basics, including navigating, computing, and querying the database by solving practical problems that Canadian businesses and industries have.

Application Design for Big Data

In this course, students use statistical models such as linear, nonlinear, Naive Bayes, decision trees, deep learning, etc., to solve real-life business problems. They will use various algorithms, data structures, and applications to handle different file formats,Big Data, etc.,for full life-cycle data-driven development. Students will have a clear understanding of how performance estimation is done. They solve real-life problems using different methods such as indexing, parallel processing, waiting-line-queuing, time series analysis, and discrete-event simulation. MongoDB, CouchDB, HBase, Hive, Pig, Sqoop, ZooKeeper, Maven and SBT are used to develop Big Data models. Students also use a high-performance machine learning framework: MAHOUT.

Introduction to Artificial Intelligence

This course presents the basics of artificial intelligence (AI) by exploring AI's core concepts, related fields, history, and practical applications. Students will solve real-life problems by implementing various AI algorithms. They will compare the AI approaches to optimize the problem-solving process utilizing large data sets, parallel processing, swarm intelligence, knowledge representation, and manipulation. The laboratory portion provides students with hands-on practices and the opportunity to apply their knowledge to real-life AI challenges using supervised classification techniques based on artificial neural networks, regression, unsupervised learning (clustering), etc. They will use PyTorch and TensorFlow to develop different AI-based solution models.

Python Programming

This course introduces the core concepts of Python programming. The theory part includes an introduction to python and its properties, primitive data types, modules, functions, loops, and conditions. The laboratory portion is designed to provide students with the opportunity to work with a set of practical problems that Canadian businesses and industries have to resolve on a day-to-day basis.

Big Data Algorithms & Statistics

This course focuses on the algorithms used for big data processing and the statistical models used for data analytics. Students will use various algorithms for dimensionality reduction, data mining, big data analytics, and developing probability-driven models. They will use regression models, descriptive and inferential statistics, support vector machines, Bayesian networks, decision trees, k-means clustering, artificial neural networks, natural language processing, etc., to solve real-life problems. Students will optimize the problem-solving process by blending combinations of algorithms and models. Furthermore, they will use different visualization techniques to visualize the output of a model. Students will develop the analytic models, statistical models, and visualizations using R and Python.

Big Data Visualization for Business Communication

In this course, students deepen their understanding of best practices and tools for presenting data analyses aligned with business needs. They are introduced to the basics and principles of data visualization and will analyze quantitative and qualitative data to create meaningful visualizations, promoting organizational decision-making. Students will learn how to present the insights of their analysis to the target audience by creating visualizations incorporating the diverse viewpoints of the company stakeholders. They will learn to acquire, parse, and analyze large datasets and assess rhetoric presentation approaches, executive presence techniques, metrics, and strategic change practices to communicate better and motivate business stakeholders to act as a group. Students will use the highly demanding visualization tools Tableau, Power BI, and AWS to create meaningful and interactive visualizations.

Job Search & Success

This course provides student with skills and knowledge to help support their career search and succeed in the workplace. Students align their personal skill set and goals to guide them on their career paths. They will learn how to effectively conduct a job search, build a professional and well-tailored resume and cover letter, and develop and practice interview techniques. Students will also develop their personal brand to help support effective career networking and aid in their job search. Teamwork and collaboration in the workplace are also discussed. Self-reflection is used to inspire insight and support their professional career journey.


Advanced Python AI & ML Tools

This course introduces advanced concepts of Python programming language. The theory includes designing, implementing, and using APIs and advanced modules for AI and ML. The laboratory portion is designed to allow students to work with a set of practical problems and apply their knowledge to real-life software application challenges. Students use Python modules to interact with a database, analyze image datasets, create visualizations, and use PySpark to handle big datasets using the Apache Spark data processing framework.

Big Data Network

This course provides the students with practical Apache Spark and Apache Kafka skills. Students will learn how to use Apache Spark for Data Engineering and Machine Learning applications. They will have real-world experience with Spark MLlib and Spark Structured Streaming. They will also learn about Resilient Distributed Datasets, or RDDs, that enable parallel processing across the nodes of a Spark cluster. The course will also help the students to get started with the fundamental Kafka operations. Students will use the Kafka streaming platform to learn how to handle data in motion. They will be able to build their own Kafka producers and consumers.

Natural Language Processing & Social Media Analytics

This course introduces the fundamentals and key concepts of Natural Language Processing (NLP) and Social Media Analytics. Students will have hands-on experience in collecting and storing Social Media Data. They will use classic machine-learning algorithms to solve problems related to machine translation, language modeling, sequence tagging, and Social Media Analytics. To analyze social media data, students will use different AI, ML, and NLP tools. The course will provide the opportunity to build, train, and apply models that analyze social media data and generate valuable social, marketing, and business insights.

Neural Networks & Deep Learning

This course introduces the core concepts of neural networks. The theory part includes an introduction to the foundations of neural networks and key parameters in a neural network's architecture. The laboratory portion provides students with hands-on practices and the opportunity to build, train and apply fully connected neural networks.

Big Data Capstone Project

In the job market, employers usually look for team players with competence and expertise. They expect candidates to have experience with teamwork in project environments. This course should prepare students to work on concrete goals in a small team. They will develop an application or design to address a Big Data problem based on pre-set requirements. Students should apply the necessary project management skills to manage planning, deadlines, milestones and deliverables with a client. Appropriate documentation should supplement the design to cover the motivation, methods and test cases. Students are expected to capture reasonable market research and a business plan to cover the business aspects.

Work Term (Full-Time)

Co-operative education provides students with the opportunity to apply classroom learning to the workplace, undertake career sampling and gain valuable work experience that may assist students in leveraging employment after graduation. For further information regarding co-op, please refer to:

--- OR ---
WIL Project

Work Integrated Learning (WIL) Project is aimed at enriching students by connecting different program areas of study, cutting across subject-matter lines, and emphasizing unifying concepts. The focus of the WIL Project is to make connections between study and industry by engaging students in relevant and meaningful activities that are connected to and practiced within the professional workplace. WIL Project allows students to enhance and strengthen their employability prospects post-graduation by fine tuning skills and knowledge and meeting the expectations of today's employers. Students are required to attend the scheduled shifts in the WIL office, reporting to the WIL Supervisor. Weekly real-world challenges are presented in the WIL office, designed by industry professionals. In addition to the weekly assigned deliverables, students are also offered professional development sessions, and exposed to industry guest speakers, enhancing their opportunity to develop their professional network.

Co-op Eligibility & WIL Project Fee

In order to be eligible to secure an approved full-time co-op work term (CPL-1049), students must have a GPA of 2.8 or greater and complete all the co-op eligibility requirements. Failing to do so will require students to enroll in CPL-5559 WIL Project at an additional cost.


Centre for Global Engagement



Room C1-210

1457 London Road

Sarnia ON, N7S 6K4

After Graduation

Employment Opportunities

Data analyst sitting at computer typing on keyboard.

Graduates of the program may be employed in roles such as Data Analyst, Analytics Specialist, Business Analytic Specialist, Project Manager or related fields.

Looking for Support After Graduation?

The International Graduate Services & Support Centre (GSSC) is a place dedicated to assisting International alumni as they seek employment and settle into Canadian life following graduation.


About Co-op

Students in this program have the opportunity to gain valuable work experience by applying classroom learning during co-op experiences.

Learn more about co-op terms and the roles and responsibilities of students and co-op advisors.

More Information

Student Responsibilities

  • Course and program delivery schedules are proposed and subject to change for each intake.
  • Students are required to bring their own laptop with wireless capability.
  • Students are advised to bring an official copy of their most recent police clearance, driver's license, and vaccination record from their home country.
An international student smiling at desk on campus.
Set yourself up for success!

Technology Requirements

It is required that students purchase a laptop with a Windows operating system.

Internet Speed Requirements

For best performance for students learning remotely, an internet connection with a minimum of 40 Mbps download and 10 Mbps upload speed is recommended in order to effectively use video conferencing and remote lecture delivery software as well as, other online resources remotely. Due to the large area over which students may be dispersed, we are unable to recommend a specific provider, so you will need to inquire around your area to find one that best suits your needs.

Minimum Laptop Requirements

In order to access the internet and virtually-delivered software and courseware, student laptops should include the following at a minimum. By meeting the following specifications, students will be equipped to access software and courseware on their laptop through the internet:

  • Intel i5 8th Gen Processor or equivalent
  • 16 GB of RAM (with a minimum of 8 GB)
  • 100 GB HDD or more
  • HD Graphics
  • Webcam with a microphone
  • Wireless 802.11n/ac 5ghz capable
  • Windows Operating System (Windows 11)


To ensure students are getting the most our of their classroom experience, some software will be required.

Lambton College has made this software easily accessible online. Students can leverage our Microsoft Office 365 software packages and services. In addition, much of the software you require for your courses will be available on demand for use on any device - on or off campus.