동아대학교 AI학과 > Papers > AI Materials

AI Studio 열린 미래 꿈이있는 대학, 동아대학교

AI Studio

AI Studio
- 세미나
- AI Materials

AI Materials

Knowledge Graphs

연구분야 소개: 지식그래프(Knowledge graphs: KG)는 현실세계의 개체간의 구조적인 관계를 표현하는 데이터 모델입니다. KG는 새롭고 유망한 지식의 표현의 방법으로서 수많은 AI 기반 첨단 응용에서 활용되고 있습니다. 첨단 응용에서 KG의 사용은 Recommendation, Question Answering(QA), Classification 등의 연구 분야에서 상당한 효과가 있음이 증명되었습니다. 이 페이지는 KG에 관심이 있는 학부생부터 석박사 과정까지 연구방향을 수립을 위한 논문들의 리스트를 제공합니다.

KG 데이터 모델의 표현 방법
- Liang, K., Meng, L., Liu, M., Liu, Y., Tu, W., Wang, S., ... & Sun, F. (2022). A Survey of Knowledge Graph Reasoning on Graph Types: Static, Dynamic, and Multimodal.
- Ji, S., Pan, S., Cambria, E., Marttinen, P., & Philip, S. Y. (2021). A survey on knowledge graphs: Representation, acquisition, and applications. IEEE transactions on neural networks and learning systems, 33(2), 494-514.

그래프 기반 추론 (Graph-based Deduction)
- Dell’Aglio, D., Della Valle, E., van Harmelen, F., & Bernstein, A. (2017). Stream reasoning: A survey and outlook. Data Science, 1(1-2), 59-83.
- Wang, D., Hu, P., Wałęga, P. A., & Grau, B. C. (2022, June). Meteor: Practical reasoning in datalog with metric temporal operators. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 36, No. 5, pp. 5906-5913).

뉴럴-심볼릭 추론 (Neural-symbolic reasoning)
- Ebrahimi, M., Sarker, M. K., Bianchi, F., Xie, N., Eberhart, A., Doran, D., ... & Hitzler, P. (2021, March). Neuro-Symbolic Deductive Reasoning for Cross-Knowledge Graph Entailment. In AAAI Spring Symposium: Combining Machine Learning with Knowledge Engineering.

튜토리얼 및 epub
- https://distill.pub/2021/gnn-intro/

Images

연구분야 소개: Image 연구 분야는 디지털 이미지의 처리, 분석, 그리고 이해에 중점을 두는 기술 분야입니다. 이 분야는 컴퓨터 비전을 활용하여 객체 인식, 이미지 분류, 패턴 인식 등의 기능을 개발하며, 이를 통해 이미지의 시각적 정보를 컴퓨터가 해석하도록 합니다. 이미지 연구는 의료 영상 분석, 안면 인식, 자동차의 자율 주행 시스템 등 다양한 응용 분야에서 중요한 역할을 합니다.

Noroozi, M., & Favaro, P. (2016, September). Unsupervised learning of visual representations by solving jigsaw puzzles. In European conference on computer vision (pp. 69-84). Cham: Springer International Publishing.
Radford, A., Metz, L., & Chintala, S. (2015). Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434.
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., & Aila, T. (2020). Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 8110-8119).
Luddecke, T., & Ecker, A. (2022). Image segmentation using text and image prompts. In 2022 IEEE. In CVF Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 7076-7086).
Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.
Isola, P., Zhu, J. Y., Zhou, T., & Efros, A. A. (2017). Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1125-1134).

NLP

연구분야 소개: 자연어 처리(NLP)는 컴퓨터가 인간의 언어를 이해하고 해석하는 데 초점을 맞춘 인공지능(AI)의 한 분야입니다. 이 분야는 텍스트와 음성 데이터를 분석하여 언어의 구조와 의미를 파악하는 것을 목표로 합니다. NLP는 기계 번역, 감정 분석, 챗봇과 같은 대화 시스템 개발 등 다양한 응용 분야에서 활용됩니다.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30.
Kenton, J. D. M. W. C., & Toutanova, L. K. (2019, June). Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of naacL-HLT (Vol. 1, p. 2).
Radford, A., Narasimhan, K., Salimans, T., & Sutskever, I. (2018). Improving language understanding by generative pre-training.
Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep contextualized word representations. In Proceedings of NAACL-HLT (pp. 2227-2237).
Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. Advances in neural information processing systems, 27.
Lewis, M., Liu, Y., Goyal, N., Ghazvininejad, M., Mohamed, A., Levy, O., ... & Zettlemoyer, L. (2019). Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. arXiv preprint arXiv:1910.13461.

Audio

연구분야 소개: Audio 연구 분야는 소리와 음향 데이터를 처리하고 분석하는 기술을 개발하는 것에 중점을 둡니다. 이 분야는 음성 인식, 음악 생성, 음향 효과 개선 등 다양한 측면을 포함하며, 컴퓨터가 사람의 귀처럼 소리를 인식하고 해석하는 방법을 연구합니다. 오디오 연구는 인공지능, 신호 처리, 기계 학습 등 여러 과학 기술 분야와 밀접하게 연결되어 있습니다.

Hershey, S., Chaudhuri, S., Ellis, D. P., Gemmeke, J. F., Jansen, A., Moore, R. C., ..0 Wilson, K. (2017, March). CNN architectures for large-scale audio classification. In 2017 ieee international conference on acoustics, speech and signal processing (icassp) (pp. 131-135). IEEE.
Park, D. S., Chan, W., Zhang, Y., Chiu, C. C., Zoph, B., Cubuk, E. D., & Le, Q. V. (2019). Specaugment: A simple data augmentation method for automatic speech recognition. arXiv preprint arXiv:1904.08779.
Ren, Y., Hu, C., Tan, X., Qin, T., Zhao, S., Zhao, Z., & Liu, T. Y. (2020). Fastspeech 2: Fast and high-quality end-to-end text to speech. arXiv preprint arXiv:2006.04558.
Oord, A. V. D., Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., ... & Kavukcuoglu, K. (2016). Wavenet: A generative model for raw audio. arXiv preprint arXiv:1609.03499.
Han, W., Zhang, Z., Zhang, Y., Yu, J., Chiu, C. C., Qin, J., ... & Wu, Y. (2020). Contextnet: Improving convolutional neural networks for automatic speech recognition with global context. arXiv preprint arXiv:2005.03191.
Kong, Q., Cao, Y., Iqbal, T., Wang, Y., Wang, W., & Plumbley, M. D. (2020). Panns: Large-scale pretrained audio neural networks for audio pattern recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, 2880-2894.

Video

연구분야 소개: Video 분야는 영상 데이터의 처리, 분석 및 이해에 초점을 맞춘 기술 분야입니다. 이 분야는 컴퓨터 비전을 활용하여 움직임 추적, 객체 인식, 3D 재구성과 같은 기능을 개발하며, 이를 통해 비디오 내용을 자동으로 해석하고 분류합니다. 비디오 연구는 보안 감시, 자율 주행 차량, 가상 현실 등 다양한 응용 분야에 중요한 역할을 합니다.

Tang, Z., Cho, J., Nie, Y., & Bansal, M. (2022). TVLT: Textless vision-language transformer. Advances in Neural Information Processing Systems, 35, 9617-9632.
Voigtlaender, P., Chai, Y., Schroff, F., Adam, H., Leibe, B., & Chen, L. C. (2019). Feelvos: Fast end-to-end embedding learning for video object segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 9481-9490).
Wang, J., Yang, Z., Hu, X., Li, L., Lin, K., Gan, Z., ... & Wang, L. (2022). Git: A generative image-to-text transformer for vision and language. arXiv preprint arXiv:2205.14100.
Liu, T., Sun, J. J., Zhao, L., Zhao, J., Yuan, L., Wang, Y., ... & Adam, H. (2022). View-invariant, occlusion-robust probabilistic embedding for human pose. International Journal of Computer Vision, 130(1), 111-135.
Usman, B., Tagliasacchi, A., Saenko, K., & Sud, A. (2022). Metapose: Fast 3d pose from multiple views without 3d supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 6759-6770).
Li, B., Wu, W., Wang, Q., Zhang, F., Xing, J., & Yan, J. (2019). Siamrpn++: Evolution of siamese visual tracking with very deep networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 4282-4291).
Mehta, D., Sotnychenko, O., Mueller, F., Xu, W., Elgharib, M., Fua, P., ... & Theobalt, C. (2020). XNect: Real-time multi-person 3D motion capture with a single RGB camera. Acm Transactions On Graphics (TOG), 39(4), 82-1.
Yuan, L., Qian, R., Cui, Y., Gong, B., Schroff, F., Yang, M. H., ... & Liu, T. (2022). Contextualized spatio-temporal contrastive learning with self-supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 13977-13986).
Gu, C., Sun, C., Ross, D. A., Vondrick, C., Pantofaru, C., Li, Y., ... & Malik, J. (2018). Ava: A video dataset of spatio-temporally localized atomic visual actions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6047-6056).

Reinforcement Learning

연구분야 소개: Reinforcement Learning 분야는 에이전트가 시행착오를 통해 최적의 결정을 내리는 방법을 학습하는 인공지능의 한 분야입니다. 이 분야는 에이전트가 주어진 환경에서 행동을 취하고, 그 결과로 받는 보상을 기반으로 학습하는 알고리즘을 연구합니다. 강화 학습은 게임, 로봇 공학, 자율 주행 차량 등 다양한 분야에 적용되며, 복잡한 문제를 해결하는 데 중요한 역할을 합니다.

Lillicrap, T. P., Hunt, J. J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., ... & Wierstra, D. (2015). Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971.
Haarnoja, T., Zhou, A., Abbeel, P. & Levine, S.. (2018). Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor. Proceedings of the 35th International Conference on Machine Learning, in Proceedings of Machine Learning Research 80:1861-1870 Available from
Haarnoja, T., Moran, B., Lever, G., Huang, S. H., Tirumala, D., Wulfmeier, M., ... & Heess, N. (2023). Learning agile soccer skills for a bipedal robot with deep reinforcement learning. arXiv preprint arXiv:2304.13653.
Plappert, M., Andrychowicz, M., Ray, A., McGrew, B., Baker, B., Powell, G., ... & Zaremba, W. (2018). Multi-goal reinforcement learning: Challenging robotics environments and request for research. arXiv preprint arXiv:1802.09464.
Mania, H., Guy, A., & Recht, B. (2018). Simple random search provides a competitive approach to reinforcement learning. arXiv preprint arXiv:1803.07055.
Lanctot, M., Lockhart, E., Lespiau, J. B., Zambaldi, V., Upadhyay, S., Perolat, J., ... & Ryan-Davis, J. (2019). OpenSpiel: A framework for reinforcement learning in games. arXiv preprint arXiv:1908.09453.
Zambaldi, V., Raposo, D., Santoro, A., Bapst, V., Li, Y., Babuschkin, I., ... & Battaglia, P. (2018). Relational deep reinforcement learning. arXiv preprint arXiv:1806.01830.

스킵네비게이션

AI학과

전체메뉴

AI Materials