Publications

 


1. Journal Paper

  1. Jun-Ichi Iwata, Daisuke Takahashi, Atsushi Oshiyama, Taisuke Boku, Kenji Shiraishi, Susumu Okada and Kazuhiro Yabana: A massively-parallel electronic-structure calculations based on real-space density functional theory, Journal of Computational Physics, Vol. 229, No. 6, pp. 2339--2363 (2010).
  2. Tetsuya Sakurai, Yoshihisa Kodaki, Hiroto Tadano, Daisuke Takahashi, Mitsuhisa Sato and Umpei Nagashima: A parallel method for large sparse generalized eigenvalue problems using a GridRPC system, Future Generation Computer Systems, Vol. 24, No. 6, pp. 613--619 (2008).
  3. Taisuke Boku, Hajime Susa, Kenji Onuma, Masayuki Umemura, Mitsuhisa Sato and Daisuke Takahashi: Formation of Dwarf Galaxies in Reionized Universe with Heterogeneous Multicomputer System, International Journal for Multiscale Computational Engineering, Vol. 4, No. 2, pp. 281--289 (2006).
  4. Daisuke Takahashi: An algorithm for multiple-precision floating-point multiplication, Applied Mathematics and Computation, Vol. 166, No. 2, pp. 291--298 (2005).
  5. Daisuke Takahashi: A parallel 1-D FFT algorithm for the Hitachi SR8000, Parallel Computing, Vol. 29, No. 6, pp. 679--690 (2003).
  6. Daisuke Takahashi, Mitsuhisa Sato and Taisuke Boku: Performance Evaluation of the Hitachi SR8000 Using SPEC OMP2001 Benchmarks, International Journal of Parallel Programming, Vol. 31, No. 3, pp. 185--196 (2003).
  7. Daisuke Takahashi: Efficient implementation of parallel three-dimensional FFT on clusters of PCs, Computer Physics Communications, Vol. 152, No. 2, pp. 144--150 (2003).
  8. Daisuke Takahashi: An Extended Split-Radix FFT Algorithm, IEEE Signal Processing Letters, Vol. 8, No. 5, pp. 145--147 (2001).
  9. Daisuke Takahashi: A fast algorithm for computing large Fibonacci numbers, Information Processing Letters, Vol. 75, No. 6, pp. 243--246 (2000).
  10. Daisuke Takahashi and Yasumasa Kanada: High-Performance Radix-2, 3 and 5 Parallel 1-D Complex FFT Algorithms for Distributed-Memory Parallel Computers, The Journal of Supercomputing, Vol. 15, No. 2, pp. 207--228 (2000).

2. Conference Proceedings

  1. Daisuke Takahashi: A Parallel Algorithm for Multiple-Precision Division by a Single-Precision Integer, Proc. 6th International Conference on Large-Scale Scientific Computations (LSSC 2007), Lecture Notes in Computer Science, No. 4818, pp. 729--736, Springer-Verlag (2008).
  2. Chikafumi Takahashi, Mitsuhisa Sato, Daisuke Takahashi, Taisuke Boku, Hiroshi Nakamura, Masaaki Kondo and Motonobu Fujita: Empirical Study for Optimization of Power-Performance with On-Chip Memory, Proc. First International Workshop on Advanced Low Power Systems (ALPS 2006), Lecture Notes in Computer Science, No. 4759, pp. 466--479, Springer-Verlag (2008).
  3. Daisuke Takahashi: Implementation and Evaluation of Parallel FFT Using SIMD Instructions on Multi-Core Processors, Proc. 10th International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA 2007), pp. 53--59 (2008).
  4. Daisuke Takahashi: An Implementation of Parallel 1-D FFT Using SSE3 Instructions on Dual-Core Processors, Proc. Workshop on State-of-the-Art in Scientific and Parallel Computing (PARA 2006), Lecture Notes in Computer Science, No. 4699, pp. 1178--1187, Springer-Verlag (2007).
  5. Akira Nukada, Daisuke Takahashi, Reiji Suda and Akira Nishida: High Performance FFT on SGI Altix 3700, Proc. 3rd International Conference on High Performance Computing and Communications (HPCC 2007), Lecture Notes in Computer Science, No. 4782, pp. 396--407, Springer-Verlag (2007).
  6. Takayuki Imada, Mitsuhisa Sato, Yoshihiko Hotta, Hideaki Kimura, Taisuke Boku and Daisuke Takahashi: Power-performance Evaluation on Ultra-Low Power High-performance Cluster System: MegaProto/E, Proc. IEEE Symposium on Low-Power and High-Speed Chips (COOL Chips X), pp. 117--129 (2007).
  7. Takayuki Okamoto, Shinichi Miura, Taisuke Boku, Mitsuhisa Sato and Daisuke Takahashi: RI2N/UDP: High bandwidth and fault-tolerant network for PC-cluster based on multi-link Ehternet, Proc. 2007 IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), pp. 287 (2007).
  8. Hideaki Kimura, Mitsuhisa Sato, Yoshihiko Hotta, Taisuke Boku and Daisuke Takahashi: Empirical Study on Reducing Energy of Parallel Programs using Slack Reclamation by DVFS, Proc. 2006 IEEE International Conference on Cluster Computing (Cluster 2006), pp. 1--10 (2006).
  9. Takuya Yokozawa, Daisuke Takahashi, Taisuke Boku and Mitsuhisa Sato: Efficient Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix Multiplication, Proc. 4th International Workshop on Parallel Matrix Algorithms and Applications (PMAA'06), pp. 37--38 (2006).
  10. Taisuke Boku, Mitsuhisa Sato, Akira Ukawa, Daisuke Takahashi, Shinji Sumimoto, Kouichi Kumon, Takashi Moriyama and Masaaki Shimizu: PACS-CS: A large-scale bandwidth-aware PC cluster for scientific computations, Proc. Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06), pp. 233--240 (2006).
  11. Daisuke Takahashi: A Hybrid MPI/OpenMP Implementation of a Parallel 3-D FFT on SMP Clusters, Proc. 6th International Conference on Parallel Processing and Applied Mathematics (PPAM 2005), Lecture Notes in Computer Science, No. 3911, pp. 970--977, Springer-Verlag (2006).
  12. Yoshiaki Aida, Yoshihiro Nakajima, Mitsuhisa Sato, Tetsuya Sakurai, Daisuke Takahashi and Taisuke Boku: Performance Improvement by Data Management Layer in a Grid RPC System, Proc. First International Conference on Grid and Pervasive Computing (GPC 2006), Lecture Notes in Computer Science, No. 3947, pp. 324--335, Springer-Verlag (2006).
  13. Taisuke Boku, Mitsuhisa Sato, Daisuke Takahashi, Hiroshi Nakashima, Hiroshi Nakamura, Satoshi Matsuoka and Yoshihiko Hotta: MegaProto/E: Power-Aware High-Performance Cluster with Commodity Technology, Proc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), pp. 346 (2006).
  14. Yoshihiko Hotta, Mitsuhisa Sato, Hideaki Kimura, Satoshi Matsuoka, Taisuke Boku and Daisuke Takahashi: Profile-based Optimization of Power Performance by using Dynamic Voltage Scaling on a PC cluster, Proc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2006), pp. 340 (2006).
  15. Daisuke Takahashi, Taisuke Boku and Mitsuhisa Sato: An Implementation of Parallel 3-D FFT Using Short Vector SIMD Instructions on Clusters of PCs, Proc. 7th International Workshop on Applied Parallel Computing (PARA 2004), Lecture Notes in Computer Science, No. 3732, pp. 1159--1167, Springer-Verlag (2006).
  16. Tetsuya Sakurai, Kentaro Hayakawa, Mitsuhisa Sato and Daisuke Takahashi: A Parallel Method for Large Sparse Generalized Eigenvalue Problems by OmniRPC in a Grid Environment, Proc. 7th International Workshop on Applied Parallel Computing (PARA 2004), Lecture Notes in Computer Science, No. 3732, pp. 1151--1158, Springer-Verlag (2006).
  17. Daisuke Takahashi, Mitsuhisa Sato and Taisuke Boku: Computation of High-Precision Mathematical Constants in a Combined Cluster and Grid Environment, Proc. 5th International Conference on Large-Scale Scientific Computations (LSSC 2005), Lecture Notes in Computer Science, No. 3743, pp. 454--461, Springer-Verlag (2006).
  18. Yoshinori Ojima, Mitsuhisa Sato, Taisuke Boku and Daisuke Takahashi: Design of a Software Distributed Shared Memory System using an MPI communication layer, Proc. 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN 2005), pp. 220--229 (2005).
  19. Shinichi Miura, Takayuki Okamoto, Taisuke Boku, Mitsuhisa Sato and Daisuke Takahashi: Low-cost High-bandwidth Tree Network for PC Clusters based on Tagged-VLAN Technology, Proc. 8th International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN 2005), pp. 84--93 (2005).
  20. Hiroshi Nakashima, Hiroshi Nakamura, Mitsuhisa Sato, Taisuke Boku, Satoshi Matsuoka, Daisuke Takahashi and Yoshihiko Hotta: MegaProto: 1TFlops/10kW Rack Is Feasible Even with Only Commodity Technology, Proc. 2005 ACM/IEEE Conference on Supercomputing (SC|05), pp. 28 (2005).
  21. Mitsuhisa Sato, Yoshihiro Nakajima, Tetsuya Sakurai, Taisuke Boku and Daisuke Takahashi: OmniRPC Grid Parallel Programming Environment for a Large Scale Numerical Computation, Proc. 17th IMACS World Congress Scientific Computation, Applied Mathematics and Simulation (2005).
  22. Hiroshi Nakashima, Hiroshi Nakamura, Mitsuhisa Sato, Taisuke Boku, Satoshi Matsuoka, Daisuke Takahashi and Yoshihiko Hotta: MegaProto: A Low-Power and Compact Cluster for High-Performance Computing, Proc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 11, Vol. 12, pp. 231b (2005).
  23. Taisuke Boku, Kenji Onuma, Mitsuhisa Sato, Yoshihiro Nakajima and Daisuke Takahashi: Grid environment for computational astrophysics driven by GRAPE-6 with HMCS-G and OmniRPC, Proc. 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4, Vol. 5, pp. 176a (2005).
  24. Yoshinori Ojima, Mitsuhisa Sato, Taisuke Boku and Daisuke Takahashi: Design of Software Distributed Shared Memory System using MPI communication layer, Proc. 4th International Workshop on OpenMP: Experiences and Implementations (WOMPEI 2005), pp. 18--25 (2005).
  25. Taisuke Boku, Mitsuhisa Sato, Masazumi Matsubara and Daisuke Takahashi: OpenMPI --- OpenMP like tool for easy programming in MPI, Proc. 6th European Workshop on OpenMP (EWOMP 2004), pp. 83--88 (2004).
  26. Yoshihiro Nakajima, Mitsuhisa Sato, Hitoshi Goto, Taisuke Boku and Daisuke Takahashi: Implementation and Performance Evaluation of CONFLEX-G: Grid-enabled Molecular Conformational Space Search Program with OmniRPC, Proc. 18th International Conference on Supercomputing (ICS'04), pp. 154--163 (2004).
  27. Chikafumi Takahashi, Masaaki Kondo, Taisuke Boku, Daisuke Takahashi, Hiroshi Nakamura and Mitsuhisa Sato: SCIMA-SMP: on-chip memory processor architecture for SMP, Proc. 3rd Workshop on Memory Performance Issues (WMPI'04), pp. 121--128 (2004).
  28. Taisuke Boku, Hajime Susa, Kenji Onuma, Masayuki Umemura, Mitsuhisa Sato and Daisuke Takahashi: Formation of Dwarf Galaxies in Reionized Universe with Heterogeneous Multi-Computer System, Proc. International Conference on Computational Science 2004 (ICCS 2004), Lecture Notes in Computer Science, No. 3039, pp. 629--636, Springer-Verlag (2004).
  29. Yuhsuke Ohtaki, Daisuke Takahashi, Taisuke Boku and Mitsuhisa Sato: Parallel Implementation of Strassen's Matrix Multiplication Algorithm for Heterogeneous Clusters, Proc. 18th International Parallel and Distributed Processing Symposium (IPDPS'04) - Workshop 1, Vol. 2, pp. 112a (2004).
  30. Yoshihiko Hotta, Mitsuhisa Sato, Taisuke Boku, Daisuke Takahashi and Chikafumi Takahashi: Measurement and Characterization of Power Consumption of Microprocessors for Power-aware Cluster, Proc. An International Symposium on Low-Power and High-Speed Chips (COOL Chips VII), pp. 293--303 (2004).
  31. Yoshihiro Nakajima, Mitsuhisa Sato, Taisuke Boku, Daisuke Takahashi and Hitoshi Gotoh: Performance Evaluation of OmniRPC in a Grid Environment, Proc. 2004 International Symposium on Applications and the Internet Workshops (SAINT 2004 Workshops), pp. 658--664 (2004).
  32. Kenji Onuma, Taisuke Boku, Mitsuhisa Sato, Daisuke Takahashi, Hajime Susa and Masayuki Umemura: Heterogeneous Remote Computing System for Computational Astrophysics with OmniRPC, Proc. 2004 International Symposium on Applications and the Internet Workshops (SAINT 2004 Workshops), pp. 623--629 (2004).
  33. Shinichi Miura, Taisuke Boku, Mitsuhisa Sato and Daisuke Takahashi: RI2N --- Interconnection Network System for Clusters with Wide-Bandwidth and Fault-Tolerancy Based on Multiple Links, Proc. 5th International Symposium on High Performance Computing (ISHPC 2003), Lecture Notes in Computer Science, No. 2858, pp. 342--351, Springer-Verlag (2003).
  34. Daisuke Takahashi: A Radix-16 FFT Algorithm Suitable for Multiply-Add Instruction Based on Goedecker Method, Proc. 2003 IEEE International Conference on Multimedia and Expo (ICME 2003), Vol. 2, pp. 845--848 (2003).
  35. Daisuke Takahashi, Mitsuhisa Sato and Taisuke Boku: An OpenMP Implementation of Parallel FFT and Its Performance on IA-64 Processors, Proc. International Workshop on OpenMP Applications and Tools (WOMPAT 2003), Lecture Notes in Computer Science, No. 2716, pp. 99--108, Springer-Verlag (2003).
  36. Taisuke Boku, Mitsuhisa Sato, Kenji Onuma, Junichiro Makino, Hajime Susa, Daisuke Takahashi, Masayuki Umemura and Akira Ukawa: HMCS-G: Grid-enabled Hybrid Computing System for Computational Astrophysics, Proc. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'03), pp. 558--565 (2003).
  37. Mitsuhisa Sato, Taisuke Boku and Daisuke Takahashi: OmniRPC: a Grid RPC System for Parallel Programming in Cluster and Grid Environment, Proc. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'03), pp. 206--213 (2003).
  38. Shinsuke Nara, Yuichi Goto, Daisuke Takahashi and Jingde Cheng: Parallel Forward Deduction System for General-Purpose Entailment Calculus on Clusters of PCs, Proc. IASTED International Conference on Networks, Parallel and Distributed Processing, and Applications (NPDPA 2002), pp. 359--364 (2002).
  39. Yuichi Goto, Daisuke Takahashi and Jingde Cheng: Improving Performance of Automated Forward Deduction System EnCal on Shared-Memory Parallel Computers, Proc. Third International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2002), pp. 63--68 (2002).
  40. Daisuke Takahashi, Taisuke Boku and Mitsuhisa Sato: A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs, Proc. 8th International Euro-Par Conference (Euro-Par 2002), Lecture Notes in Computer Science, No. 2400, pp. 691--700, Springer-Verlag (2002).
  41. Daisuke Takahashi: A Blocking Algorithm for Parallel 1-D FFT on Shared-Memory Parallel Computers, Proc. 6th International Conference on Applied Parallel Computing (PARA 2002), Lecture Notes in Computer Science, No. 2367, pp. 380--389, Springer-Verlag (2002).
  42. Daisuke Takahashi, Mitsuhisa Sato and Taisuke Boku: Performance Evaluation of the Hitachi SR8000 Using OpenMP Benchmarks, Proc. 4th International Symposium on High Performance Computing (ISHPC 2002), Lecture Notes in Computer Science, No. 2327, pp. 390--400, Springer-Verlag (2002).
  43. Yuichi Goto, Daisuke Takahashi and Jingde Cheng: Parallel Forward Deduction Algorithms of General-Purpose Entailment Calculus on Shared-Memory Parallel Computers, Proc. 2nd International Conference on Software Engineering, Artificial Intelligence, Networking & Parallel/Distributed Computing (SNPD'01), pp. 168--175 (2001).
  44. Daisuke Takahashi: A Blocking Algorithm for FFT on Cache-Based Processors, Proc. 9th International Conference on High Performance Computing and Networking Europe (HPCN Europe 2001), Lecture Notes in Computer Science, No. 2110, pp. 551--554, Springer-Verlag (2001).
  45. Daisuke Takahashi: A Mixed-Radix Parallel Three-Dimensional FFT Algorithm on Clusters of Vector SMPs, Proc. Tenth SIAM Conference on Parallel Processing for Scientific Computing, (CD-ROM), 10 pages (2001).
  46. Seiji Nishimura, Daisuke Takahashi, Takaomi Shigehara, Hiroshi Mizoguchi and Taketoshi Mishima: A Performance Study on a Single Processing Node of the HITACHI SR8000, Proc. Second International Conference on Numerical Analysis and Its Applications (NAA 2000), Lecture Notes in Computer Science, No. 1988, pp. 628--635, Springer-Verlag (2001).
  47. Daisuke Takahashi: A Parallel 3-D FFT Algorithm on Clusters of Vector SMPs, Proc. 5th International Workshop on Applied Parallel Computing (PARA 2000), Lecture Notes in Computer Science, No. 1947, pp. 316--323, Springer-Verlag (2001).
  48. Daisuke Takahashi: Implementation of Multiple-Precision Parallel Division and Square Root on Distributed-Memory Parallel Computers, Proc. 2000 International Workshop on Parallel Processing (ICPP'00 Workshops), pp. 229--235 (2000).
  49. Seiji Nishimura, Daisuke Takahashi, Takaomi Shigehara, Hiroshi Mizoguchi and Taketoshi Mishima: Efficient Implementation of CG & CR Methods for Linear Systems on a Single Processing Node of HITACHI SR8000, Proc. 2000 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC2000), pp. 298--301 (2000).
  50. Daisuke Takahashi: A New Radix-6 FFT Algorithm Suitable for Multiply-Add Instruction, Proc. 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2000), Vol. 6, pp. 3343--3346 (2000).
  51. Daisuke Takahashi: High-Performance Parallel FFT Algorithms for the HITACHI SR8000, Proc. Fourth International Conference/Exhibition on High Performance Computing in Asia-Pacific Region (HPC-Asia 2000), Vol. 1, pp. 192--199 (2000).
  52. Daisuke Takahashi, Yasumasa Kanada: Fast High-Precision Arithmetic on Distributed Memory Parallel Machines, Proc. Ninth SIAM Conference on Parallel Processing for Scientific Computing, (CD-ROM), 10 pages (1999).

3. Oral Presentation

  1. Daisuke Takahashi: Automatic Tuning for Parallel 3-D FFT with 2-D Decomposition, 2010 SIAM Conference on Parallel Processing for Scientific Computing, Grand Hyatt Seattle, Seattle, Washington, USA, February 25, 2010.
  2. Daisuke Takahashi: A Volumetric 3-D FFT on Clusters of Multi-Core Processors, Third French-Japanese PAAP Workshop, Shiran-Kaikan Hall Annex, Kyoto, April 21, 2009.
  3. Daisuke Takahashi: A Volumetric 3-D FFT on Clusters of Multi-Core Processors, 2009 SIAM Conference on Computational Science and Engineering, Miami Hilton Downtown, Miami, Florida, USA, March 5, 2009.
  4. Daisuke Takahashi: Automatic Tuning for Parallel FFTs, Second French-Japanese PAAP Workshop, ENSEEIHT-IRIT, Toulouse, France, June 24, 2008.
  5. Daisuke Takahashi: Automatic Tuning for Parallel FFTs, 2008 SIAM Conference on Parallel Processing for Scientific Computing, The Renaissance Atlanta Hotel Downtown, Atlanta, Georgia, USA, March 12, 2008.
  6. Daisuke Takahashi: The FFTE Library and the HPC Challenge (HPCC) Benchmark Suite, First French-Japanese PAAP Workshop, Next-Generation Supercomputer R& D Center, RIKEN, Chiyoda-ku, Tokyo, November 2, 2007.

4. Invited Talk

  1. Daisuke Takahashi: Parallel Implementation of Multiple-Precision Arithmetic and 2.576 Trillion Digits of Pi Calculation on a Massively Parallel Cluster of Multi-Core Processors, Workshop on Ultra Performance and Dependable Acceleration Systems (held in conjunction with PDCAT'09), Gakushi-kaikan, Hiroshima University, Higashi-Hiroshima, December 11, 2009.

5. Book

  1. Daisuke Takahashi: Implementation of Multiple-Precision Parallel Division and Square Root on Distributed-Memory Parallel Computers, Yi Pan and Laurence T. Yang (Eds.): Parallel and Distributed Scientific and Engineering Computing: Practice and Experience, Nova Science Publishers, New York, pp. 35--49 (2004).