Biography:Ashish Vaswani
Ashish Vaswani is a computer scientist working in deep learning,[1] who is known for his significant contributions to the field of artificial intelligence (AI) and natural language processing (NLP). He is one of the co-authors of the seminal paper "Attention Is All You Need"[2] which introduced the Transformer model, a novel architecture that uses a self-attention mechanism and has since become foundational to many state-of-the-art models in NLP. Transformer architecture is the core of language models that power applications such as ChatGPT.[3][4][5] He was a co-founder of Adept AI Labs[6][7] and a former staff research scientist at Google Brain.[8][9]
Career
Vaswani completed his engineering in Computer Science from BIT Mesra in 2002. In 2004, he moved to the US to pursue higher studies at University of Southern California.[10] He did his PhD at the University of Southern California.[11] He has worked as a researcher at Google,[12] where he was part of the Google Brain team. He was a co-founder of Adept AI Labs but left the company.[13][14]
Notable works
Vaswani's most notable work is the paper "Attention Is All You Need", published in 2017.[15] The paper introduced the Transformer model, which eschews the use of recurrence in sequence-to-sequence tasks and relies entirely on self-attention mechanisms. The model has been instrumental in the development of several subsequent state-of-the-art models in NLP, including BERT,[16] GPT-2, and GPT-3.
References
- ↑ "Ashish Vaswani". https://scholar.google.com/citations?user=oR9sCGYAAAAJ&hl=en.
- ↑ Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, Łukasz; Polosukhin, Illia (2017). "Attention is All you Need". Advances in Neural Information Processing Systems (Curran Associates, Inc.) 30. https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf.
- ↑ "Inside the brain of ChatGPT" (in en). https://stackbuilders.com/blog/inside-the-brain-of-chatgpt/.
- ↑ "Understanding ChatGPT as explained by ChatGPT" (in en-US). 2023-01-18. https://www.advancinganalytics.co.uk/blog/2023/1/18/language-models-what-is-chatgpt.
- ↑ Seetharaman, Deepa; Jin, Berber (2023-05-08). "ChatGPT Fever Has Investors Pouring Billions Into AI Startups, No Business Plan Required" (in en-US). Wall Street Journal. ISSN 0099-9660. https://www.wsj.com/articles/no-business-plan-no-problem-chatgpt-spawns-an-investor-gold-rush-in-ai-6bdbed3c.
- ↑ "Introducing Adept". https://www.adept.ai/blog/introducing-adept.
- ↑ "Top ex-Google AI researchers raise $8 million in funding from Thrive Capital". The Economic Times. May 4, 2023. https://economictimes.indiatimes.com/tech/technology/top-ex-google-ai-researchers-raise-funding-from-thrive-capital/articleshow/99990103.cms?from=mdr.
- ↑ Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N.; Kaiser, Lukasz; Polosukhin, Illia (May 21, 2017). "Attention is All You Need". arXiv:1706.03762 [cs.CL].
- ↑ Shead, Sam (2022-06-10). "A.I. gurus are leaving Big Tech to work on buzzy new start-ups" (in en). https://www.cnbc.com/2022/06/10/ai-gurus-are-leaving-big-tech-to-work-on-buzzy-new-start-ups.html.
- ↑ Team, OfficeChai (February 4, 2023). "The Indian Researchers Whose Work Led To The Creation Of ChatGPT". https://officechai.com/stories/the-indian-researchers-whose-work-led-to-the-creation-of-chatgpt/.
- ↑ "Ashish Vaswani's webpage at ISI". https://www.isi.edu/~avaswani/.
- ↑ "Transformer: A Novel Neural Network Architecture for Language Understanding". August 31, 2017. https://ai.googleblog.com/2017/08/transformer-novel-neural-network.html.
- ↑ Rajesh, Ananya Mariam; Hu, Krystal; Rajesh, Ananya Mariam; Hu, Krystal (March 16, 2023). "AI startup Adept raises $350 mln in fresh funding". Reuters. https://www.reuters.com/technology/adept-raises-350-mln-series-b-funding-2023-03-14/.
- ↑ Tong, Anna; Hu, Krystal; Tong, Anna; Hu, Krystal (2023-05-04). "Top ex-Google AI researchers raise funding from Thrive Capital" (in en). Reuters. https://www.reuters.com/technology/top-ex-google-ai-researchers-raise-funding-thrive-capital-sources-2023-05-04/.
- ↑ "USC Alumni Paved Path for ChatGPT". https://viterbischool.usc.edu/news/2023/03/attention-is-all-you-need-usc-alumni-paved-path-for-chatgpt/.
- ↑ Devlin, Jacob; Chang, Ming-Wei; Lee, Kenton; Toutanova, Kristina (May 24, 2019). "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding". arXiv:1810.04805 [cs.CL].
Original source: https://en.wikipedia.org/wiki/Ashish Vaswani.
Read more |