'AI NLP Study' 카테고리의 글 목록

AI NLP Study

Coursera - Machine Learning_Andrew Ng - Week 3 (Logical Regression) 2022.01.26
Coursera - Machine Learning_Andrew Ng - Week 2 2022.01.25
Coursera - Machine Learning_Andrew Ng - Week 1 2022.01.25
Linear Algebra(1) 2022.01.23
Python Object-Oriented Programming (객체 지향 프로그래밍) 2022.01.10
Python 자료구조와 Pythonic Programming 2022.01.10
Python Overview, Variable & Operator 2022.01.07

Coursera - Machine Learning_Andrew Ng - Week 3 (Logical Regression)

2022. 1. 26. 17:56

Classification: y = 0 or y = 1

if hθ(x) ≥ 0.5, predict y=1

if hθ(x) < 0.5, predict y=0

⇒ logistic regression: 0 ≤ hθ(x) ≤ 1

Hypothesis Representation

- Sigmoid function (==logistic function)

(cf) hθ(x) = 0.7 ⇒ 70% chance of ~~~

Decision boundary

hθ(x) = g(θ0+θ1x1+θ2x2) ⇒ predict y=1 if -3+x1+x2 ≥ 0

Cost function

- How to choose parameter θ?

Simplified cost function and gradient descent

* convert the two lines into one line

Logistic regression cost function

Gradient Descent

*Looks same as linear regression!

BUT, hθ(x) are different! ==>

Multi-class classification (one-vs-all)

Sigmoid function VS softmax classifier

⇒ sigmoid: get percentage on how y might equal to 1 for each class

⇒ softmax: get the distribution of percentage of the classes

저작자표시 비영리 변경금지

'AI NLP Study > Machine Learning_Andrew Ng' 카테고리의 다른 글

Coursera - Machine Learning_Andrew Ng - Week 2 (0)	2022.01.25
Coursera - Machine Learning_Andrew Ng - Week 1 (0)	2022.01.25

Coursera - Machine Learning_Andrew Ng - Week 2

2022. 1. 25. 18:12

2022.01.25

Coursera - Machine Learning_Andrew Ng - Week 2

Multiple features(variables)

Gradient descent for multiple variables

Gradient descent in practice 1: Feature Scaling

- feature scaling

: simple trick to apply → make gradients run much faster and converge in a lot fewer other iterations.

: make sure features are on a similar scale ⇒ get every features into approximately a -1<=xi<=1 range

- mean normalization

Gradient Descent in practice 2: Learning rate

- Debugging: make sure gradient descent is working correctly

(use visualization using plot vs automatic convergence test)

If α is too small ⇒ slow convergence

If α is too big ⇒ J(θ) may not decrease on every iteration; may not converge

Features and Polynomial Regression

example of polynomial regression formula

Normal Equation formula

⇒ Compare with Gradient Descent

Gradient Descent	Normal Equation
needs to choose α	no need to choose α
needs many iterations	don't need iteratation
works well even when n is large	slow if n is very large + need to compute

저작자표시 비영리 변경금지

'AI NLP Study > Machine Learning_Andrew Ng' 카테고리의 다른 글

Coursera - Machine Learning_Andrew Ng - Week 3 (Logical Regression) (0)	2022.01.26
Coursera - Machine Learning_Andrew Ng - Week 1 (0)	2022.01.25

Coursera - Machine Learning_Andrew Ng - Week 1

2022. 1. 25. 17:35

Supervised learning

-classification vs regression(contiguous variables)

Unsupervised learning

-no answers given to the algorithm ⇒ computer automatically analyze

-cocktail party problem ⇒ 2 audio recordings → separate out the two voices ⇒ can be done with single line of code

⇒ [W,s,v] = svd((repmat(sum(x.*x,1),size(x,1),1).*x)*x’);

⇒ use “Octave” or “Matlab” ⇒ it’s faster

[Linear Regression]

Model Representation

-supervised learning has training set

-training set → learning algorithm

* hypothesis:

Cost Function

⇒ Goal: minimize J(θ0 , θ1) ⇒ global minimum

⇒ use contour plots/figures for visualization

⇒ linear line of h(x) is converted to a single point in cost function graph

Gradient Descent Algorithm

If is α too small ⇒ gradient descent can be slow (alpha = step size)

If is α too big ⇒ gradient descent fail to converge, or even diverge

α rate doesn’t need to decrease →automatically take smaller steps

Batch Gradient Descent: every step needs to calculate all training sets in batches

Review:

Although there is difficulty in understanding the whole process, particularly the gradient descent equation, I am fairly able to get the big picture and the important concepts of machine learning regarding supervised/unsupervised learning, model representation, cost function, and gradient descent algorithm.

I am currently able to follow the contents and able to solve the quiz in Coursera for each lecture without much difficulty, yet!

저작자표시 비영리 변경금지

'AI NLP Study > Machine Learning_Andrew Ng' 카테고리의 다른 글

Coursera - Machine Learning_Andrew Ng - Week 3 (Logical Regression) (0)	2022.01.26
Coursera - Machine Learning_Andrew Ng - Week 2 (0)	2022.01.25

Linear Algebra(1)

2022. 1. 23. 12:05

Linear combination = 기본변형 (eg. v1, 3v1-1v2+1/2v3)

Span: set of all linear combinations of the vectors (기본변형으로 만들 수 있는 모든 벡터들)

Linear combination을 통한 matrix multiplication의 inner product & outer product 계산 방법 존재

Linearly independent: only one solution (trivial solution)

Linearly dependent: other nontrivial solutions / linearly dependent set produces multiple possible linear combinations.

Subspace: a subset of R^n closed under linear combination

==> a subspace is always represented as Span{v1, …, vp}

Basis of a subspace: set of vectors that satisfies (1) fully spans the given subspace H (2)linearly independent

•eg. H = Span{v1, v2, v3} è Span{v1, v2} forms a plane, but v3=2v1+3v2 ∈ Span{v1, v2} è {v1, v2} is a basis of H, but not {v1, v2, v3} nor {v1} is a basis.

•Basis is not unique

•BUT, 어떤 종류의 basis라도 그 basis에 있는 벡터의 갯수=dimension은 unique!)

Column Space

(eg. Linearly dependent columns)

Rank of matrix A = dim Col A

Linear Tranformation(선형 변환)

저작자표시 비영리 변경금지

Python Object-Oriented Programming (객체 지향 프로그래밍)

2022. 1. 10. 14:34

Object-Oriented Programming

class Student(object):  # 클래스 선언부 (예약어 - 클래스 이름 - 부모 클래스)
    SCHOOL = 'GOORM'    # 클래스 속성 (Class attribute)

    def __init__(self, name: str, sid: int):     # 생성자 (현재 수정하고자 하는 객체=self)
        self.name = name                         # 속성 (Attribute)
        self.sid = sid
        self.classes = set()

    # 클래스 함수 (Method)
    def take_class(self, class_name: str) -> None:
        self.classes.add(class_name)

    def drop_class(self, class_name: str) -> None:
        self.classes.discard(class_name)

# 클래스 생성
gildong_hong = Student('Gildong Hong', 20224352)
# 속성 출력
printgildong_hong.name, "in", Student.SCHOOL)
# 메소드 실행
gildong_hong.take_class("CS101")
gildong_hong.take_class("CS202")
gildong_hong.drop_class("CS101")

# 출력 결과
# Gildong Hong in GOORM

Magic Method (매직 메소드)

(cf) __init__, __call__, ___length__, __getitem__ 을 많이 사용함

생성자 (__init__)

소멸자 (__del__)

Indexing 메소드 (__getitem__, __setitem__)

etc

#Indexing
class DoubleMapper(object):
    def __init__(self):
        self.mapping = {}
    def __getitem__(self, index):  # Indexing get
        return self.mapping.get(index, index * 2)
    def __setitem__(self, index, item):  # Indexing set
        self.mapping[index] = item
mapper = DoubleMapper()
print(mapper[10], mapper[1, 2]) # 20 (1, 2, 1, 2)
mapper[10] = 15
print(mapper[10], mapper[1, 2]) # 15 (1, 2, 1, 2)

#lengh 메소드
class Dataset:
	def __init__(self, data, times=3):
        self.data = data
        self.times = times
    def __len__(self):  # len(instance) 호출될 시 호출
        return len(self.data) * self.times
    def __getitem__(self, index):
        if index > len(self):
            raise IndexError()
        return self.data[index % len(self.data)]
dataset = Dataset([10, 2, 5, 2], times=5)
print(len(dataset)) # 20\

#Typing
class Student:
    def __init__(self, name: str, sid: int):
        self.name = name
        self.sid = sid
    def __str__(self): # str 형변환
        return self.name + '_' + str(self.sid)
gildong_hong = Student("Gildong Hong", 20224372)
print(gildong_hong)  # Gildong Hong_20224372

#함수화 메소드(__call__)
class AdditionNumber(object):
    def __init__(self, number: int):  # 생성자
        self.number = number

    # 함수화 메소드
    def __call__(self, number: int): # *args, **kargs도 받을 수 있다.
        return number + self.number

addition_5 = AdditionNumber(5)
print(addition_5(10))  # 15 ==> 객체를 함수처럼 사용
                       # (cf) add.__call__(*args)

# iterable
seq = [1, 2, 3, 4, 5]
for elem in seq:
    print(elem)
print()

seq = list([1, 2, 3, 4, 5])
iterable = iter(seq)    #iter 내장함수(__iter__호출)
while True:
    try:
        elem = next(iterable)   #next 내장함수(__next__호출)
    except StopIteration:
        break

#context manager
class Student:
    def __init__(self, name, sid):
        self.name = name
        self.sid = sid

    def __enter__(self):  # with 구문에 들어갈 때 사용, return 값이 as 이하로 할당
        self.classes = set()
        return self

    def __exit__(self, exc_type, exc_value, trace):  # with 구문 나갈 때 사용
        self.classes.clear()

gildong_hong = Student("Gildong Hong", 20224372)
with gildong_hong:
    gildong_hong.classes.add('CS201')
with Student("Gildong Hong", 20224372) as gildong_hong:
    gildong_hong.classes.add('CS201')

print(gildong_hong.name, gildong_hong.sid)  # Gildong Hong 20224372

# getter & setter 사용하기
class Circle(object):
    PI = 3.141592
    def __init__(self, raidus=3.):
self.radius = raidus
    def get_area(self):
        return Circle.PI * self.radius ** 2
    def set_area(self, value):
        self.radius = (value / Circle.PI) ** .5

circle = Circle(5.)
print(circle.get_area())
circle.set_area(10)
print(circle.radius)

# property 사용하기 (@ 사용)
class Circle(object):
    PI = 3.141592
    def __init__(self, raidus=3.):
        self.radius = raidus
    @property
    def area(self):
        return Circle.PI * self.radius ** 2
    @area.setter
    def area(self, value):
        self.radius = (value / Circle.PI) ** .5

circle = Circle(5.)
print(circle.area)
circle.area = 10.
print(circle.radius)

Inheritance & Polymorphism

명시적인 private & protected 범위가 없음 → 모두 public

• private 변수/함수 이름 앞에 “__”를 붙임 (밑줄 2개)

Ex) self.__name, self.__sid

• protected 변수/함수 이름 앞에 “_”를 붙임 (밑줄 1개)

Ex) self._name, self._sid

class Student:
    def __init__(self, name: str, sid: int):
        self.name = name
        self.sid = sid
        self.classes = []

    def __str__(self):
        return self.name + "_" + str(self.sid)

    def take_class(self, class_name: str) -> None:
        self.classes.append(class_name)

class Master(Student):  # Student 상속
    def __init__(self, name: str, sid: int, professor: str):
        super().__init__(name, sid)  # 부모 클래스 생성자 접근, 정해진 부르는 타이밍은 없다.
        self.professor = professor

    def __str__(self):  # __str__ 재정의 → 다형성
        return super().__str__() + "_" + str(self.professor)

master = Master('Gildong Hong', 20224375, 'Prof Shin')
print(master)   # Gildong Hong_20224375_Prof Shin
print(super(Master, master).__str__()) # Gildong Hong_20224375 ==> super로 언제나 원하는 상위 클래스로 변환

# function visibility
class TestClass(object):
    def __init__(self):
        self.attr = 1 # Public 
        self._attr = 2 # Protected 
        self.__attr = 3 # Private
instance = TestClass()
print(dir(instance))
# 출력 결과
#['_TestClass__attr', '__class__', '__delattr__', 
# '__dict__', '__dir__', '__doc__', '__eq__', '__format__', 
# '__ge__', '__getattribute__', '__gt__', '__hash__', '__init__', 
# '__init_subclass__', '__le__', '__lt__', '__module__', '__ne__', 
# '__new__', '__reduce__', '__reduce_ex__', '__repr__', '__setattr__', 
# '__sizeof__', '__str__', '__subclasshook__', '__weakref__', '_attr', 'attr']

저작자표시 비영리 변경금지

'AI NLP Study > Python' 카테고리의 다른 글

Python 자료구조와 Pythonic Programming (0)	2022.01.10
Python Overview, Variable & Operator (0)	2022.01.07

Python 자료구조와 Pythonic Programming

2022. 1. 10. 14:21

Data Structure

Lists

: 아무 타입이나 넣기 가능 + 길이가 정해져 있지 않음

List indexing: seq[index] 형태로 요소 하나 접근
List slicing: seq[start: end: step] 형태로 List 자르기 (end포함X)

Reserved Words VS Built-in functions VS Methods

문법적 요소, 괄호X, 재정의 불가능  del, if-else, assert
기본 정의된 함수, 별개 함수 사용, 재정의 가능, 편의성 향상 --> len(), range(), sum()
객체 내 정의된 함수, .methods()로 접근, 해당 객체를 다룸 --> .append(), .insert(), .extend()

Tuple

Immutable list(불변 타입 리스트)
( ) 를 사용
함수에서 2개 이상 요소를 반환할 때 사용
Tuple안의 요소는 가변 타입일 수도 있음
String 타입일 때 일종의 문자 tuple로 생각 가능

Packing and Unpacking

Unpacking: * (Asterisk)로 남는 요소를 리스트로 남기기 가능

Dictionary

매핑을 위한 데이터 구조 (key , value)
Key: 불변 타입만! + 중복이 불가능
Set: Dictionary의 Key만 집합처럼 모여 있는 형태

Conditional Statements

if-elif-else
삼항 연산자: [Value1] if [Condition] else [Value2]

Loop

While Statement: 조건을 만족하는 동안 출력
For Statement: 주어진 객체를 순환 (for [Element] in [Iterable])
break: 가장 바깥쪽의 반복문 나가기
continue: 가장 바깥의 반복문의 처음으로 되돌아가기

Scope

Global scope: 최상위에 선언 == 다른 파일에서도 접근 가능
Local scope: 함수 안에 선언 == 상위함수에서 접근 불가

Variable Capture

상위 값이 바뀌면 하위 값이 바뀜
파라미터가 없는데도 불구하고 함수내에서 바뀌어버림 --> 따라서, 파라미터를 써줘야 버그가 발생이 없어진다.
Closure = Factory형식
파이썬에서 함수 = 일반 객체
변수로 할당 가능

Decorator

함수 하나를 인자로 받아  같은 형태의 함수를 반환하는 함수
@을 사용
Class를 사용할 시 Decorator에 인자 추가가 가능

Parameters

Function Parameters
Variable Length Parameter
Keyword Variable Length Parameter

Pythonic Programming

Comprehension

result = [i * 2 for i in range(10)] // append
result = {str(i): i for i in range(10)} // key:value
result = {str(i) for i in range(10)} // add()
evens = [i for i in range(100) if i % 2 == 0] // 원하는 요소만 추가
result = [(i, j) for i in range(5) for j in range(i)] // 이중 for문
eye = [[int(i == j) for j in range(5)] for i in range(5)] // 다차원 배열 만들기

Generator

요소를 하나씩 생성해서 반환하는 객체
Function에 yield를 사용하면 Generator가 된다
괄호로 generator comprehension 형태로 선언 가능
even_generator = (i * 2 for i in range(100))

Lambda Function

함수 이름 없이 빠르게 만들어 쓸 수 있는 함수 --> add = lambda a, b : a+b
seq = [6, -2, 8, 4, -5]  list(map(lambda x: x * 2, seq)) --> [12, -4, 16, 8, -10]
seq = [6, -2, 8, 4, -5]  list(filter(lambda x: x > 0, seq)) --> [6, 8, 4]

저작자표시 비영리 변경금지

'AI NLP Study > Python' 카테고리의 다른 글

Python Object-Oriented Programming (객체 지향 프로그래밍) (0)	2022.01.10
Python Overview, Variable & Operator (0)	2022.01.07

Python Overview, Variable & Operator

2022. 1. 7. 00:34

Python Overview

Features of python

플랫폼 독립적인 인터프리터 언어
완전 객체 지향 언어
동적 타이핑 언어

Variable & Operator

How to Name Variables

알파벳, 숫자, 언더스코어(_)로 선언
변수명은 그 변수의 특징이 잘 살아 있게 하자(가독성)
변수명은 대소문자가 구분
변수명으로 쓸 수 없는 예약어가 존재

Out-place VS In-place 연산

Out-place: 명시적으로 새로운 객체 생성 // a = a + 1
In-place: 기존 객체를 수정 시도하고, 불가능할 시 새로운 객체 생성 // a += 1

Primitive Data Types(Immutable / Mutable Types)

Immutable Type (불변 타입)이다
Python의 모든 것은 객체  Primitive Data Type 들 역시 객체
*불변 타입들은 저장된 값이 변하지 않는다!
모든 타입은 Physical Memory 주소를 가르침
Primitive Data Type과 Tuple을 제외한 다른 모든 파이썬 객체는 Mutable Type (가변 타입)

파이썬에서 대입은 메모리 주소 복사의 원칙적

값을 복사하지 않고 같은 주소를 공유
immutable인데 수정이 필요하면 새로운 객체 생성

primitive data 크기에 따른 객체 할당 방법

흔한 객체는 기존 객체를 들고 온다
복잡한 값을 가지면 객체를 새로 형성

Types

* Dynamic Typing:

데이터 타입은 코드 실행 지점에서 정해진다 // a = 10 VS int a = 10

* Implicit Type Conversion

: bool → int → float → complex 순서로 타입이 정해진다

* Explicit Type Conversion

: [Type]([value])로 명시적 형 변환

* Type Checking

: type(), isinstance([variable], [type])

저작자표시 비영리 변경금지

'AI NLP Study > Python' 카테고리의 다른 글

Python Object-Oriented Programming (객체 지향 프로그래밍) (0)	2022.01.10
Python 자료구조와 Pythonic Programming (0)	2022.01.10

PREV 1 NEXT

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

AI NLP Study

'AI NLP Study > Machine Learning_Andrew Ng' 카테고리의 다른 글

'AI NLP Study > Machine Learning_Andrew Ng' 카테고리의 다른 글

'AI NLP Study > Machine Learning_Andrew Ng' 카테고리의 다른 글

Object-Oriented Programming

Magic Method (매직 메소드)

Inheritance & Polymorphism

'AI NLP Study > Python' 카테고리의 다른 글

Lists

Reserved Words VS Built-in functions VS Methods

Tuple

Packing and Unpacking

Dictionary

Conditional Statements

Loop

Scope

Variable Capture

Decorator

Parameters

Pythonic Programming

Comprehension

Generator

Lambda Function

'AI NLP Study > Python' 카테고리의 다른 글

Python Overview

Features of python

Variable & Operator

How to Name Variables

Out-place VS In-place 연산

Primitive Data Types(Immutable / Mutable Types)

파이썬에서 대입은 메모리 주소 복사의 원칙적

primitive data 크기에 따른 객체 할당 방법

Types

'AI NLP Study > Python' 카테고리의 다른 글

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역