FileMood

Download Pазработчик BigData

Pазработчик BigData

Name

Pазработчик BigData

  DOWNLOAD Copy Link

Trouble downloading? see How To

Total Size

15.3 GB

Total Files

457

Hash

3309E50DA1C228721C1C506E410E5BDA56F0504B

/data_gathering/data_gathering/

__init__.py

0.0 KB

vkstat.cfg

0.2 KB

gathering.py

7.6 KB

scrapped_data.txt

50.1 KB

/data_gathering/data_gathering/storages/

__init__.py

0.0 KB

storage.py

0.3 KB

file_storage.py

1.0 KB

/data_gathering/data_gathering/scrappers/

__init__.py

0.0 KB

scrapper.py

0.9 KB

/data_gathering/data_gathering/parsers/

__init__.py

0.0 KB

filter_parser.py

0.5 KB

test_parsers.py

0.5 KB

html_parser.py

0.6 KB

parser.py

0.6 KB

/.../lecture_27_mr/ClickStreamExtra/

stopwords.txt

0.0 KB

run.sh

0.3 KB

ClickStreamExtra.java

3.9 KB

/.../lecture_28_pipelines/docker-airflow/

.dockerignore

0.0 KB

circle.yml

0.3 KB

.gitignore

0.5 KB

docker-compose-LocalExecutor.yml

0.8 KB

Dockerfile

2.3 KB

docker-compose-CeleryExecutor.yml

2.9 KB

README.md

6.0 KB

LICENSE

11.4 KB

/.../lecture_27_mr/WordCountSimple/

file01

0.0 KB

file02

0.0 KB

run.sh

0.2 KB

WordCount.java

2.2 KB

/.../lecture_28_pipelines/docker-airflow/.git/

HEAD

0.0 KB

description

0.1 KB

config

0.3 KB

index

0.9 KB

packed-refs

1.6 KB

/3. Визуализация/homework/vkstatsbot/

_gitignore

0.0 KB

vkstat_example.cfg

0.0 KB

constants.py

0.3 KB

visualisation.py

0.9 KB

bot_handlers.py

1.5 KB

vk_api.py

2.3 KB

main.py

2.4 KB

/homework/vkstatsbot/

.gitignore

0.0 KB

vkstat_example.cfg

0.0 KB

constants.py

0.3 KB

visualisation.py

0.9 KB

bot_handlers.py

1.5 KB

vk_api.py

2.3 KB

main.py

2.4 KB

/.../lecture_28_pipelines/docker-airflow/.git/refs/remotes/origin/

HEAD

0.0 KB

/.../lecture_28_pipelines/docker-airflow/.git/refs/heads/

master

0.0 KB

/18. Анализ текстовых данных 2/

requirements.txt

0.0 KB

lecture_18_text.ipynb

174.9 KB

LDA.ipynb

478.1 KB

lesson18.mp4

265.1 MB

/30. Организация хранения данных для решения задач машинного обучения/

sample.txt

0.1 KB

connect

0.3 KB

sample.scala

0.4 KB

start_jupyter.sh

0.6 KB

word_count.py

0.9 KB

titanic.ipynb

21.4 KB

test.csv

28.6 KB

train.csv

61.2 KB

lecture_30_spark.pdf

402.4 KB

BigData-2018-03 2018 07 26 20 07 43.mp4

578.6 MB

/31. Spark/

broad.py

0.1 KB

examples.scala

1.0 KB

spark_hw.pdf

29.1 KB

lecture_31_spark.pdf

403.8 KB

BigData-2018-03 2018 07 31 20 03 46.mp4

895.0 MB

/25. Процесс CRISP-DM. Выбор хранилища, запросы к базе (Реляционная, нереляционная). Большие данные и параллельные вычисления/

test_env.sh

0.1 KB

text_3.txt

0.1 KB

text_1.txt

0.1 KB

mapper.py

0.1 KB

reducer.py

0.2 KB

text.txt

0.2 KB

text_2.txt

0.3 KB

run.sh

0.3 KB

homework-watermark.pdf

54.3 KB

alice.txt

151.9 KB

lecture_25-watermark.pdf

2.9 MB

BigData-2018-03 2018 07 10 20 02 49.mp4

813.0 MB

/.../lecture_28_pipelines/luigi/

run.sh

0.1 KB

luigi_mr_conf.cfg

0.1 KB

luigi_graph.py

1.1 KB

luigi_mr.py

1.3 KB

/15. Бустинг/

ДЗ.txt

0.1 KB

otus.png

17.7 KB

homework.ipynb

22.3 KB

lecture_15_ens_lib.ipynb

302.5 KB

BigData-2018-03 2018 05 29 20 01 07.mp4

283.6 MB

/11. Уменьшение размерности/

ДЗ.txt

0.1 KB

chat.txt

5.3 KB

homework_dimred.ipynb

6.9 KB

proj_1.png

7.1 KB

dim_var.png

13.5 KB

otus.png

17.7 KB

PearsonFig.jpg

18.1 KB

pca.png

27.2 KB

proj.png

50.9 KB

data.csv

125.2 KB

dims.png

129.3 KB

lecture_11_dimred.ipynb

1.5 MB

orders.csv

76.3 MB

lesson11.mp4

716.8 MB

/lecture_19/

requirements.txt

0.1 KB

homework.txt

3.3 KB

netflix_progress.jpg

10.1 KB

star_ratings.png

16.2 KB

otus.png

17.7 KB

mf.png

21.5 KB

svd.png

25.7 KB

effect_factorizations.png

28.8 KB

linkedin.png

33.0 KB

lecture_19_rec.ipynb

37.9 KB

lecture_19_rec_p2.ipynb

55.4 KB

fm.png

72.9 KB

cfuser.png

111.8 KB

content.png

127.8 KB

amazon.png

327.6 KB

lastfm.png

356.0 KB

lamoda.png

418.2 KB

/7. kMeans, EM/

ДЗ.txt

0.1 KB

] edit.png

7.1 KB

homework-clustering.ipynb

124.0 KB

lecture_07_clustering.pdf

382.4 KB

lecture_07_clustering.ipynb

1.1 MB

BigData-2018-03 2018 05 03 19 59 53.mp4

251.0 MB

/13. Деревья решений/

ДЗ.txt

0.1 KB

chat.txt

9.0 KB

homework.ipynb

9.1 KB

lecture_13_trees.ipynb

267.9 KB

BigData 2018 03 2018 05 22.mp4

884.7 MB

/31. Spark/stackoverflow/

build.sbt

0.1 KB

/scala_project/

build.sbt

0.1 KB

SparkWordCount.scala

1.0 KB

/data_gathering/data_gathering/parsers/__pycache__/

__init__.cpython-34.pyc

0.2 KB

filter_parser.cpython-34.pyc

1.0 KB

test_parsers.cpython-34.pyc

1.1 KB

parser.cpython-34.pyc

1.2 KB

/data_gathering/data_gathering/storages/__pycache__/

__init__.cpython-34.pyc

0.2 KB

storage.cpython-34.pyc

0.8 KB

file_storage.cpython-34.pyc

1.5 KB

/data_gathering/data_gathering/scrappers/__pycache__/

__init__.cpython-34.pyc

0.2 KB

scrapper.cpython-34.pyc

1.0 KB

vk_api.cpython-34.pyc

2.3 KB

/5. Логистическая регрессия/

ДЗ.txt

0.2 KB

exercises.ipynb

9.6 KB

homework.ipynb

79.9 KB

05_log_regression.ipynb

550.3 KB

lecture_05_logreg.pdf

2.0 MB

BigData-2018-03 2018 04 24 20 01 11.mp4

264.4 MB

/.../lecture_28_pipelines/docker-airflow/.git/hooks/

post-update.sample

0.2 KB

pre-applypatch.sample

0.4 KB

applypatch-msg.sample

0.5 KB

commit-msg.sample

0.9 KB

prepare-commit-msg.sample

1.2 KB

pre-push.sample

1.4 KB

pre-commit.sample

1.6 KB

update.sample

3.6 KB

pre-rebase.sample

4.9 KB

/.../lecture_28_pipelines/docker-airflow/.git/logs/

HEAD

0.2 KB

/.../lecture_28_pipelines/docker-airflow/.git/logs/refs/heads/

master

0.2 KB

/.../lecture_28_pipelines/docker-airflow/.git/logs/refs/remotes/origin/

HEAD

0.2 KB

/17. Анализ текстовых данных/

ДЗ.txt

0.2 KB

otus.png

17.7 KB

lecture_17_text_word2vec.ipynb

59.2 KB

lecture_17_text.ipynb

60.1 KB

spam.csv

503.7 KB

BigData-2018-03 2018 06 07 20 01 01.mp4

267.6 MB

/1. Базовые инструменты анализа данных в Python/

ДЗ.txt

0.2 KB

exercises.ipynb

15.0 KB

Roadmap.pdf

29.7 KB

homework_description.pdf

31.0 KB

env.pdf

116.5 KB

lecture-1-intro.ipynb

156.8 KB

lecture_01_intro.pdf

910.5 KB

lesson1.mp4

257.7 MB

/homework/vkstatsbot/.idea/

misc.xml

0.2 KB

modules.xml

0.3 KB

vkstatsbot.iml

0.4 KB

workspace.xml

9.5 KB

/data_gathering/data_gathering/.idea/

misc.xml

0.2 KB

modules.xml

0.3 KB

data_gathering.iml

0.4 KB

workspace.xml

37.3 KB

/.../lecture_28_pipelines/docker-airflow/.git/info/

exclude

0.2 KB

/.../lecture_27_mr/ClickStream/

run.sh

0.3 KB

ClickStream.java

2.2 KB

/.../lecture_27_mr/ClickStreamTool/

run.sh

0.5 KB

ClickStreamTool.java

2.5 KB

/3. Визуализация/

populations.txt

0.5 KB

crimeRatesByState2005.tsv

2.9 KB

nba.csv

5.0 KB

cars.csv

10.7 KB

lecture_03_vis.pdf

625.3 KB

flights.csv

2.4 MB

3_Data_Visualisation_in_Python.ipynb

7.0 MB

BigData-2018-03 2018 04 10 20 00 15.mp4

258.4 MB

/23. Нейронные сети, часть 2/

ДЗ.txt

0.6 KB

neuron.png

5.4 KB

chat.txt

6.9 KB

AutoEncoder.png

8.4 KB

otus.png

17.7 KB

lecture_23_nn_pytorch.ipynb

33.0 KB

net.jpeg

68.9 KB

graph.png

160.2 KB

DL.pdf

677.1 KB

BigData 2018 07 03 20 00.mp4

664.8 MB

/29. Слои данных для оптимизации процессов использования данных. Hive/

wiki_part.hql

0.6 KB

clickstream.sql

0.7 KB

wiki_part_orc.hql

1.2 KB

homework.pdf

35.0 KB

smallwikipedia.csv

55.4 KB

lecture_29_hive.pdf

702.5 KB

BigData-2018-03 2018 07 24 20 02 05.mp4

682.4 MB

/.../lecture_28_pipelines/cron/

daily.sh

0.7 KB

/22. Нейронные сети, часть 1/

ДЗ.txt

1.1 KB

neuron.png

5.4 KB

chat.txt

6.4 KB

pytorch_tutorial.ipynb

13.9 KB

otus.png

17.7 KB

g2.png

77.9 KB

act.png

78.0 KB

g3.png

79.8 KB

g1.png

94.7 KB

graph.png

160.2 KB

lecture_22_nn_pytorch.ipynb

404.3 KB

backpropagation.pdf

1.2 MB

BigData-2018-03 2018 06 28 20 00 00.mp4

667.5 MB

/.../lecture_28_pipelines/docker-airflow/dags/__pycache__/

tuto.cpython-36.pyc

1.1 KB

/.../lecture_28_pipelines/docker-airflow/dags/

tuto.py

1.3 KB

/20. Временные ряды/

AirPassengers.csv

1.7 KB

chat.txt

5.2 KB

cv.png

6.2 KB

ts1.png

8.1 KB

ts3.png

8.2 KB

ts2.png

9.2 KB

otus.png

17.7 KB

Sales_Transactions_Dataset_Weekly.csv

317.4 KB

lecture_20_ts.ipynb

775.6 KB

otus_items.txt

83.8 MB

BigData-2018-03-2018 06 21 20 00 00.mp4

622.6 MB

/2. Вводная в математические операции/

] vectors.png

1.8 KB

uniform_dist.png

9.9 KB

[corr.png

10.4 KB

uniform_f.png

11.0 KB

otus.png

17.7 KB

norm_f.png

24.2 KB

norm_dist.png

25.2 KB

limit.png

34.0 KB

corr2.png

53.5 KB

lecture_02_math.ipynb

65.6 KB

p_x.png

167.4 KB

lecture_02_math.pdf

486.0 KB

Correlation_examples2.svg

2.3 MB

BigData-2018-03 2018 04 05 20 00 46.mp4

236.9 MB

/.../lecture_28_pipelines/oozie/

example.xml

2.1 KB

/28. Пайплайны. Способы выстроить поток задач, обеспечить выполнение. Отказоустойчивость, мониторинг/

chat.txt

2.6 KB

lecture_28_pipelines.pdf

2.2 MB

BidData-2018-03 - 2018 07 19.mp4

610.3 MB

/.../lecture_28_pipelines/docker-airflow/script/

entrypoint.sh

2.7 KB

/24. Алгоритмы на графах/

125px_Undirected.png

2.8 KB

125px_Directed.png

2.9 KB

203px_Unconnected_graph.png

6.1 KB

photo_2018_02_06_15_57_29.jpg

13.8 KB

otus.png

17.7 KB

585px_VR_complex.svg

22.4 KB

photo_2018_02_06_01_28_08.jpg

49.3 KB

photo_2018_02_06_19_27_02.jpg

59.3 KB

image016.jpg

61.6 KB

6_centrality_measures.png

166.6 KB

animation_d5.gif

1.5 MB

lecture24_networks.ipynb

9.2 MB

BigData-2018-03-2018.mp4

521.6 MB

/9. Feature engineering/

sample_submission.csv

2.8 KB

gender_submission.csv

3.3 KB

otus.png

17.7 KB

test.csv

28.6 KB

Features Homework.pdf

53.4 KB

train.csv

61.2 KB

grad.png

123.6 KB

lecture_09_features.ipynb

496.1 KB

BigData-2018-03 2018 05 10 20 00 26.mp4

229.4 MB

/19. Рекомендательные системы/

homework.txt

3.3 KB

lecture_19_rec.ipynb

37.9 KB

lecture_19_rec_p2.ipynb

55.4 KB

BigData-2018-03 2018 06 19 20 01 15.mp4

225.4 MB

/homework/

homework_description.txt

3.6 KB

/pics/

Stogra.png

3.9 KB

var.png

8.9 KB

sgd_plot.pdf

10.4 KB

otus.png

17.7 KB

g2.png

25.7 KB

g3.png

30.8 KB

g1.png

37.4 KB

ada.png

42.0 KB

nesterov.png

45.2 KB

nesterov2.png

63.9 KB

grad.png

231.2 KB

saddle_point_evaluation_optimizers.gif

731.2 KB

contours_evaluation_optimizers.gif

914.6 KB

/31. Spark/stackoverflow/src/main/scala/

StackOverflowDataset.scala

4.2 KB

/pics/

DS1.png

4.3 KB

DS2.png

4.3 KB

DS3.png

4.4 KB

otus.png

17.7 KB

DT1.png

17.8 KB

DT6.png

18.2 KB

DT5.png

18.2 KB

DT3.png

18.7 KB

DT4.png

18.7 KB

DT2.png

35.0 KB

housing.png

71.7 KB

creditdecisiontree.png

73.6 KB

golf.png

77.6 KB

obama.jpg

119.2 KB

/.../lecture_16_svm/

margin.png

4.9 KB

otus.png

17.7 KB

regression.png

18.2 KB

linear.png

18.2 KB

slack.png

20.1 KB

rocauc.png

40.2 KB

svm_equations.png

72.0 KB

svm_plots.png

176.0 KB

/3. Визуализация/pics/

heatmap.png

7.3 KB

box.png

11.2 KB

otus.png

17.7 KB

bar.png

22.8 KB

plot.png

23.8 KB

pair.png

106.3 KB

pie.png

108.2 KB

salaries.png

175.1 KB

scatter.png

295.7 KB

/6. KNN, наивный байес/

chat.txt

7.7 KB

lecture_06_knn.pdf

460.8 KB

l6_knn_ex.ipynb

1.3 MB

l6_knn.ipynb

1.3 MB

BigData-2018-03 2018 04 26 20 04 06.mp4

236.5 MB

/21. Latent Dirichlet Allocation/

AB_Testing_Normal_Curve.jpg

8.7 KB

ab_process.png

13.0 KB

h1.png

15.3 KB

otus.png

17.7 KB

BabyAge_Control.jpg

24.9 KB

BabyAge_Variation.jpg

26.4 KB

Hypothesis_Testing.jpg

36.2 KB

button_ab_test.png

38.9 KB

band.png

48.7 KB

ab_test_3_kak_provoditsya_ab_testirovanie.jpg

54.2 KB

lecture_21_ab.ipynb

61.0 KB

BigData-2018-03 06 26.mp4

404.8 MB

/data_gathering/data_gathering/__pycache__/

gathering.cpython-34.pyc

9.1 KB

/3. Визуализация/homework/

description.docx

9.8 KB

/5. Логистическая регрессия/pics/

logistic_function_plot.pdf

11.8 KB

sgd_plot.pdf

12.0 KB

step.pdf

13.6 KB

underfitting_learning_curves_plot.pdf

13.8 KB

classification_random_line.pdf

14.7 KB

classification.pdf

14.9 KB

logloss.pdf

15.1 KB

error_function.pdf

15.3 KB

regression_poly.pdf

15.8 KB

regression.pdf

15.8 KB

regression_poly_predicted.pdf

17.2 KB

descision.pdf

17.3 KB

regression_poly_overfit.pdf

17.3 KB

otus.png

17.7 KB

regression_random_line.pdf

17.8 KB

zero_one_loss.pdf

18.5 KB

classification_error.pdf

18.5 KB

regression_estimated.pdf

19.1 KB

regression_random_line_mse.pdf

19.6 KB

supervised.pdf

21.4 KB

supervised.png

29.1 KB

circles.pdf

31.2 KB

circles0050001.pdf

34.8 KB

circles020001.pdf

35.7 KB

circles005001.pdf

37.7 KB

circles021.pdf

38.0 KB

circles00501.pdf

38.1 KB

circles02001.pdf

38.2 KB

circles0201.pdf

38.3 KB

circles0210000.pdf

38.6 KB

circles0051.pdf

38.6 KB

regression_3d.pdf

52.7 KB

regression_3d_estimated.pdf

54.8 KB

unsupervised.pdf

107.0 KB

grad.png

231.2 KB

irissm.pdf

492.2 KB

iris.pdf

492.3 KB

/.../lecture_28_pipelines/docker-airflow/config/

airflow.cfg

12.9 KB

/pics/

m1.png

13.3 KB

m2.png

14.3 KB

blobs_new.pdf

15.3 KB

blobs_nearest1.pdf

15.5 KB

blobs.pdf

15.5 KB

regression_poly_overfit.pdf

15.7 KB

of.png

16.0 KB

m3.png

16.1 KB

otus.png

17.7 KB

lr_cls.png

55.7 KB

knn_cls.png

61.4 KB

knn_cls_1.png

72.3 KB

vtt.png

73.7 KB

cv.png

138.9 KB

/pics/

regression_poly_overfit.pdf

13.7 KB

underfitting_learning_curves_plot.pdf

13.8 KB

regression_poly.pdf

15.7 KB

regression.pdf

15.8 KB

regression_poly_predicted.pdf

16.9 KB

regression_random_line.pdf

17.5 KB

otus.png

17.7 KB

regression_estimated.pdf

18.3 KB

regression_random_line_mse.pdf

18.9 KB

supervised.pdf

20.4 KB

supervised.png

29.1 KB

regression_3d.pdf

40.7 KB

regression_3d_estimated.pdf

41.9 KB

unsupervised.pdf

107.5 KB

/4. Линейная регрессия/

meeting_saved_chat.txt

15.3 KB

] exercises.ipynb

32.9 KB

exercises_key.ipynb

75.0 KB

04_linear_regression.ipynb

520.5 KB

0_Информация.pdf

552.0 KB

lecture_04_linreg.pdf

673.6 KB

BigData-2018-03 2018 04 19 20 00.mp4

570.1 MB

/26. Vowpal Wabbit для обучения линейных моделей на одной машине/

otus.png

17.7 KB

lecture_26.ipynb

256.8 KB

lecture_26_vw.pdf

373.4 KB

BigData-2018-03 2018 07 12 20 05 13.mp4

630.5 MB

/14. Ансамбли моделей/

otus.png

17.7 KB

lecture_14_ens.ipynb

490.3 KB

BigData-2018-03 2018 05 24 20 00 00.mp4

262.9 MB

/pics/

otus.png

17.7 KB

latent-dirichlet-allocation-7-1024.jpg

54.3 KB

kmeans.png

84.4 KB

gauss.png

112.2 KB

LogDirichletDensity-alpha_0.3_to_alpha_2.0.gif

3.5 MB

/16. SVM, Support vector machine/

linear.png

18.2 KB

lecture_16_svm.ipynb

442.1 KB

BigData-2018-03 2018 05 31 20 01 15.mp4

242.6 MB

/.../lecture_28_pipelines/docker-airflow/.git/objects/pack/

pack-350773030e4626b979dd0965444cfb4dc5defb79.idx

19.0 KB

pack-350773030e4626b979dd0965444cfb4dc5defb79.pack

159.5 KB

/32. Обзор решений для аналитики больших данных/

Проект.pdf

25.5 KB

lecture_32_schemas.pdf

1.7 MB

BigData-2018-03 2018 08 02 20 02 42.mp4

571.5 MB

/27. MapReduce на Java, Hadoop Streaming - MapReduce на Python, bash/

homework.pdf

28.4 KB

lecture_27_mapred.pdf

396.0 KB

BigData-2018-03 2018 07 17 part 1.mp4

125.2 MB

BigData-2018-03 2018 07 17 part 2.mp4

419.4 MB

/8. Иерархическая кластеризация, DB-Scan/

clusters

35.0 KB

lecture_08_clustering.ipynb

1.1 MB

data.csv

45.6 MB

BigData-2018-03 2018 05 08 20 00 09.mp4

238.5 MB

/12. Методы оптимизации/

lecture_12_opt.ipynb

325.7 KB

BigData-2018-03 2018 06 05 20 01 07.mp4

192.2 MB

/10. Поиск выбросов в данных/

lecture_10_outliers.ipynb

502.6 KB

BigData-2018-15 2018 05 15 20 01 34.mp4

963.8 MB

 

Total files 457


Copyright © 2025 FileMood.com