전체 글167 Root Cause Analysis * Root and Cause Analysis , Continuous Improvement - To prevent recurrence at lowest cost in the simplest way 발본색원의 개념, 근원이 되는 문제를 제거해서 비슷한 문제들이 다시는 발생하지 않도록 방지하는 것 - method to trace down RC: data mining hierachial cluster solution (GT data mining) 1) Root cause: removed; prevent undesireable outcome,the condition that enable one or more causes 2) Casual cause: affect an event's outcome, but not.. 2024. 3. 25. Dissimilarity Matrix - Dissimilarity Matrix (n objects x p attributes) : Store a collection of proximities that are available for all pairs of n objects A triangular matix, i.e., d(i,j) = d(j,i) d(i,j) is the measured dissimilarity or "differce" between objects i and j similarity measure sim(i,j) = 1 - d(i,j) d(i,j) is a non-negative number that is close to 0 when objects i and j are highly similar or "near" each ot.. 2024. 3. 25. Basic Statistical Descriptions of Data - Quartiles : Q1 (25th percentile), Q3 (75th percentile) - Inter-quartile range: IQR = Q3- Q1 - Five number summary : min, Q1, median, Q3, max - Boxplot : ends of the box are the quartiles; median is marked ; add whiskers (minimum and maximum observation) and plot outliers individually ; whiskers : 박스 바깥쪽에 가로로 나있는 선, 최소값과 최대값을 표시 - Outlier : usually, a value higer/ lower than 1.5 x IQR ex) 30,.. 2024. 3. 25. Data Set, attributes - Data set 은 data objects로 이루어져 있고 data object는 entity 를 표현한다. ex) sales database: customers, store items, sales medical database: patients, treatments - Data objects는 attributes에 의해 설명된다. - Database의 row는 data objects로 columns는 attributes를 나타낸다. - Attribute 의 종류에는 1) Nominal 2) Binary 3) Ordinal 4) Numeric: quantitative 가 있다. 1) Nominal : categories, states or symbols - Hair_color = {auburn, bl.. 2024. 3. 25. Interestingness Measure: Correlation Lift * Measure of dependent/correlated events: lift 1) Statistiacal independence P(S∩B) = P(S) x P(B) => Statistically independence P(S∩B) > P(S) x P(B) => Postively correlated P(S∩B) Negatively correlated 2) lift : interestingness measure: (interesting = reasonable 정도로 이해하면 될 것 같다) - if the lift is equal to 1, then A and B are independent and there is no correlation between them. A⊥.. 2024. 3. 25. ECLAT ECLAT: Frequent Pattern Mining with Vertical Data Format * 기존의 Tid 순으로 표현했던 것을 각각의 아이템별로 Tid list 를 표현해낸다. - (K+1) 의 itemset 의 후보가 되려면 모든 K-item subset 이 frequent 해야한다. - Apriori 와는 달리 1) support 를 계산 하는데 짧은 시간이 걸린다. 2) 교집합을 찾기가 쉽다. 하지만 intermediate Tid 리스트가 너무 많은 메모리를 차지 할 수 있다. ex) minimum support count = 2 2024. 3. 25. QALY (Quality - adjusted life year) QALY 에 대한 위키피디아의 정의는 아래와 같다. The quality-adjusted life year or quality-adjusted life-year (QALY) is a measure of disease burden, including both the quality and the quantity of life lived. It is used in assessing the value for money of a medical intervention. According to Pliskin et al., The QALY model requires utility independent, risk neutral, and constant proportional tradeoff behaviour. The Q.. 2024. 3. 5. Critical Path Analysis Critical Path Analysis ex) 1. ES (earliest start) - Earliest start time given predecessor activities 2. EF (earliest finish) - Earliest possible completion time 3. LS (latest start) - Latest start time as to not delay entire project 4. LF (latest finish) - Latest finish time as to not delay entire project 1) Forward Pass - Proceed through diagram from strat to finish 다이아그램의 시작부터 시간을 계산해 나감 - ES.. 2024. 3. 5. Bracket median method Bracket Median Method. Continuous distribution을 discrete approximation 하는 방법 2번째 (Tukey method에 이어) - The bracket median method of interval [a,b] is a value m* between a and b such that P(a 2024. 3. 5. EVPI (expected value of perfect information) Expected Value of Perfect Information. : quantity as the maximum amount that the investor should be willing to pay the clairvoyant for perfect information. (출처: Figure. Making hard decision chapter12 pg.440) 그림을 보면, 정보가 없는 decision node에서 나오는 EMV는 high risk 일때 580이다. Consult Clairvoyant, 즉, perfect information 이 제공되는 경우, investor의 행동을 보면 market이 up 일때는 high rist stock을, market이 flat일때는 savings a.. 2024. 3. 5. certainty equivalent certainty equivalent : 확실성등가란 위험있는 수익흐름에 대하여 그 위험을 부담하는 대신 보다 적은 수익이라도 확실하게 실현될 수 있다면 그와 맞바꿀 수 있는 최소한의 가격 (네이버 지식백과) : the amount of money that is equivalent in your mind to a given situation that involves uncertainty. (making hard decision 2nd edi) 예를 들어 이해해 보도록하자, win $2,000 with probability 0.50, lose $20 with probability 0.50 인 복권의 상황을 가정해보자. 만약 친구가 나 대신 위의 복권을 사려고 한다. 얼마에 저 기회를 양보하겠는가? $300.. 2024. 3. 5. Extended Pearson-Tukey The easiest way to use a continuous distribution in a decision tree or influence diagram is to approximate it with a discrete distribution. The basic idea is to find a few representative points in the distribution and then to assign those points specific probability value. A particularly simple approach, from Keefer and Bodily (1983), is called the extended Pearson-Turkey method. (Source: Making.. 2024. 3. 5. 이전 1 ··· 10 11 12 13 14 다음