Praveen Kumar K C
Thursday, December 21, 2023
Wednesday, December 20, 2023
22BCS306A-Object Oriented Programming with JAVA
1. Syllabus
2. Text Book :
3. Notes & Presentations :
( Materials Courtesy - Thanks to : https://sites.google.com/site/harishbitcse/lecture-notes/22bcs306a-object-oriented-programming-with-java )
Tuesday, November 7, 2023
Big Data Tutorial – Learn Big Data from Scratch and Become a Pro
Big Data Tutorial – Learn Big Data from Scratch and Become a Pro
Are you ready to embark on a journey to unlock the secrets of Big Data? Our Big Data tutorial is the perfect starting point for anyone looking to master the art of big data analysis. Imagine being able to process, analyze and visualize vast amounts of data with ease. Imagine being able to turn data into actionable insights that drive business success. With our comprehensive tutorial, you’ll learn how to do just that.
What is Big Data?
Big Data is the data which cannot be managed by using traditional databases. Here is Gartner’s definition: The Data sets with huge volume, generated in different varieties with high velocity is termed as Big Data. These are considered as 3 Vs of Big Data.
These humongous volumes of data can be used to generate advanced patterns & address business problems you wouldn’t have been able to handle earlier.

Start Learning Big Data
Explore these Big Data tutorials and master the different technologies of Big Data. Choose where to begin, learn at your own pace:
Beginner:
- Big Data History and Use Cases
- Benefits of Big Data
- Why Learn Big Data?
- Top Big Data Tools
- Why Big Data is popular?
- Rise of Big Data
- Big Data Career Path
- Why choose a Career in Big Data?
- Careers and Job Roles in Big Data
- Big Data Job Roles
- Switch your career to Big Data
- Big Data Developer Skills
- Big Data Use Cases – Hadoop, Spark and Flink
- Apache Flink – A Big Data Platform
- Big Data for Union Budget
Intermediate
- Big Data Case Studies
- Vulnerability – Introducing 10th V of Big Data
- Lambda Architecture
- Big Data And Cloud Computing
- Big Data Analytics Tools
Expert
- Big Data Trends
- Big Data Quotes
- Big Data Vs Cloud Computing
- Use of Big Data and Cloud Computing
- Why You Should Move From Java to Big Data?
- Why Salesforce has acquired Tableau?
- Mainframe to Big Data
Big Data Analytics- BDA - 18CS72 notes
Big Data Analytics Materials: BDA - 18CS72
( Note: Thanks to the original author...!!! )
ML - Candidate Elimination Algorithm :
ML – Candidate Elimination Algorithm
The candidate elimination algorithm incrementally builds the version space given a hypothesis space H and a set E of examples. The examples are added one by one; each example possibly shrinks the version space by removing the hypotheses that are inconsistent with the example. The candidate elimination algorithm does this by updating the general and specific boundary for each new example.
- You can consider this as an extended form of the Find-S algorithm.
- Consider both positive and negative examples.
- Actually, positive examples are used here as the Find-S algorithm (Basically they are generalizing from the specification).
- While the negative example is specified in the generalizing form.
Terms Used:
- Concept learning: Concept learning is basically the learning task of the machine (Learn by Train data)
- General Hypothesis: Not Specifying features to learn the machine.
- G = {‘?’, ‘?’,’?’,’?’…}: Number of attributes
- Specific Hypothesis: Specifying features to learn machine (Specific feature)
- S= {‘pi’,’pi’,’pi’…}: The number of pi depends on a number of attributes.
- Version Space: It is an intermediate of general hypothesis and Specific hypothesis. It not only just writes one hypothesis but a set of all possible hypotheses based on training data-set.
Advantages of CEA over Find-S:
- Improved accuracy: CEA considers both positive and negative examples to generate the hypothesis, which can result in higher accuracy when dealing with noisy or incomplete data.
- Flexibility: CEA can handle more complex classification tasks, such as those with multiple classes or non-linear decision boundaries.
- More efficient: CEA reduces the number of hypotheses by generating a set of general hypotheses and then eliminating them one by one. This can result in faster processing and improved efficiency.
- Better handling of continuous attributes: CEA can handle continuous attributes by creating boundaries for each attribute, which makes it more suitable for a wider range of datasets.
Disadvantages of CEA in comparison with Find-S:
- More complex: CEA is a more complex algorithm than Find-S, which may make it more difficult for beginners or those without a strong background in machine learning to use and understand.
- Higher memory requirements: CEA requires more memory to store the set of hypotheses and boundaries, which may make it less suitable for memory-constrained environments.
- Slower processing for large datasets: CEA may become slower for larger datasets due to the increased number of hypotheses generated.
- Higher potential for overfitting: The increased complexity of CEA may make it more prone to overfitting on the training data, especially if the dataset is small or has a high degree of noise.
ML - Find S Algorithm
Introduction:
The find-S algorithm is a basic concept learning algorithm in machine learning. The find-S algorithm finds the most specific hypothesis that fits all the positive examples. We have to note here that the algorithm considers only those positive training example. The find-S algorithm starts with the most specific hypothesis and generalizes this hypothesis each time it fails to classify an observed positive training data. Hence, the Find-S algorithm moves from the most specific hypothesis to the most general hypothesis.
Important Representation :
- ? indicates that any value is acceptable for the attribute.
- specify a single required value ( e.g., Cold ) for the attribute.
- ϕindicates that no value is acceptable.
- The most general hypothesis is represented by: {?, ?, ?, ?, ?, ?}
- The most specific hypothesis is represented by: {ϕ, ϕ, ϕ, ϕ, ϕ, ϕ}
- Start with the most specific hypothesis.
h = {ϕ, ϕ, ϕ, ϕ, ϕ, ϕ} - Take the next example and if it is negative, then no changes occur to the hypothesis.
- If the example is positive and we find that our initial hypothesis is too specific then we update our current hypothesis to a general condition.
- Keep repeating the above steps till all the training examples are complete.
- After we have completed all the training examples we will have the final hypothesis when can use to classify the new examples.
['Sunny', 'Warm', 'Normal', 'Strong', 'Warm', 'Same', 'Yes'] ['Sunny', 'Warm', 'High', 'Strong', 'Warm', 'Same', 'Yes'] ['Rainy', 'Cold', 'High', 'Strong', 'Warm', 'Change', 'No'] ['Sunny', 'Warm', 'High', 'Strong', 'Cool', 'Change', 'Yes'] The most general hypothesis: ['?', '?', '?', '?', '?', '?'] The most specific hypothesis: ['0', '0', '0', '0', '0', '0'] Find S: Finding a maximally specific hypothesis The taining example no: 1 the hyposthesis is: ['Sunny', 'Warm', 'Normal', 'Strong', 'Warm', 'Same'] The taining example no: 2 the hyposthesis is: ['Sunny', 'Warm', '?', 'Strong', 'Warm', 'Same'] The taining example no: 3 the hyposthesis is: ['Sunny', 'Warm', '?', 'Strong', 'Warm', 'Same'] The taining example no: 4 the hyposthesis is: ['Sunny', 'Warm', '?', 'Strong', '?', '?'] The maximally specific hypohthesis for training set is ['Sunny', 'Warm', '?', 'Strong', '?', '?']
Input data Set:
Tuesday, November 20, 2018
A Complete Guide to Mastering Python
A Complete Guide to Mastering Python
Getting Started with Python
Things to Learn
-
Introduction to Cloud Computing Cloud computing, or something being in the cloud, is an expression used to describe a variet...
-
T he vast majority of the AI advancements and applications you hear about refer to a category of algorithms known as machine learning. (...




